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Forgotten foot soldiers of science 


The story of CRISPR-Cas9 gene editing has tended to focus on a few key players. But, as with any 
area of basic research, it takes a small army of talented researchers to make a discovery. 


the credit.” Harry Truman supposedly said that, and as the US 
president who dropped the atomic bomb, he had more reason 
than most to be amazed. 

Science is a collective effort, but human nature — including that 
of people who sit on grant committees and decide prizes — tends to 
select and reward individuals. Hence the popular parlour game among 
scientists: speculating on who will get the Nobel prize for impressive 
discoveries. 

They don't come much more impressive than a molecular-biology 
technique that has swept through laboratories in the past few years, 
accelerating genetic research and tantalizing the public with its poten- 
tial to change how genetic diseases are treated. Asa result, the scientists 
who helped establish the technique — which rewrites DNA using a 
bacterial immune system called CRISPR-Cas9 — have received acco- 
lades and awards, and ample coverage in the media. 

But the researchers who did much of the work — the graduate 
students and postdocs who carried out the experiments — are rarely 
mentioned. In this week’s Nature, we take a look at a small sample of 
those researchers and how their experience with CRISPR has affected 
their careers (see page 342). 

Our article is an attempt to grant these junior investigators a little 
of the limelight, but it also shows how difficult it is to do so: owing 
to space limitations, many young scientists who made important 
contributions to the field were left out of the article. (It is a decision 
that journalists face all the time, but it was made all the more painful 
in this case.) 

The fairest approach would perhaps have been to replace the article 
with a simple two-page list, in fine print, of key researchers. Even then, 
there would be missing entries. The development of CRISPR gene 
editing is basic research at its finest: a handful of researchers initially 
noticed an oddity in bacterial genomes and decided to investigate, and 
this small squad swelled into a global army of scientists investigating 
every aspect of the system and how it can best be applied to treat dis- 
eases and engineer transgenic plants and animals. It is impossible to 
adequately assign credit in any one article or documentary — or on 
one awards podium. And that means postdoctoral researchers and 
graduate students are largely left unrecognized. 

The focus on senior figures is rational. They are more likely to have 
spent many years pioneering a field, and principal investigators set the 
tone and direction of their laboratories. 

There are also benefits to staying out of the public eye, particu- 
larly for young investigators. The barrage of interview and speaking 
requests takes time away from the focus needed to launch a laboratory. 
And becoming famous for a particular advance, particularly a techno- 
logical one, can pigeonhole a young investigator at a time when they 
would rather spread their wings and visit new fields. 

But there is also a price. Aside from the ego boost, awards 


¢ Cc E is amazing what you can accomplish if you do not care who gets 


— particularly in the form of grants — bolster a career. Scientific fame 

that spreads outside one specific community has a greater chance of 

influencing review panels. This is particularly important for young 

investigators who are starting their careers in what suddenly becomes 

a highly competitive area. Senior investigators must ensure that they 

promote their best and brightest junior colleagues both inside and 
outside their field. 


“Postdoctoral The mentors featured in our article show 
researchers that this can be done. They work hard to 
and graduate foster an atmosphere of independence and 
students are fearless exploration in their laboratories. 
largely left They credit their graduate students and 


postdocs during talks, offer them oppor- 
tunities to present at major conferences, 
and actively promote their careers — even long after their junior 
colleagues have moved on to other labs. 

To achieve notoriety beyond scientific circles is a greater challenge, 
but still possible. Reporters often seek comment from celebrity scien- 
tists: the better known and more established, the better. In this case, 
reputation serves as a proxy for quality: given limited space, journalists 
seek to get the most credible comment that they can squeeze in. But 
good reporters sprinkle their articles with comments from younger 
investigators as well. Senior researchers can encourage this practice, 
and point journalists in the right direction. 

Assigning scientific credit will never be entirely fair. But with con- 
sideration and support, wouldn't it be amazing if young investigators 
got a taste of the attention — and the career boost — they deserve? = 


unrecognized.” 


Virtual taxonomy 


People everywhere are catching Pokémon. Can 
they also catch real new species? 


illions of people have spent the past week walking around. 
M Ostensibly, they are playing the online game Pokémon Go 

and hunting for critters in an ‘augmented reality’ world. But 
as gamers wander with their smartphones — through parks and neigh- 
bourhoods, and onto the occasional high-speed railway line — they 
are spotting other wildlife, too. 

Scientists and conservationists have been quick to capitalize on 
the rare potential to reach a portion of the public usually hunched 
over consoles in darkened rooms, and have been encouraging 
Pokémon hunters to snap and share online images of the real-life 
creatures they find. The question has even been asked: how long 
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before the game prompts the discovery of a new species? 

It’s not out of the question: success is 90% perspiration after all, and 
millions of gamers peering around corners and under bushes across 
the world can create a very sweaty exercise indeed. By definition, each 
Pokémon hunter almost certainly holds a high-definition camera 
in their hands. And there is a precedent: earlier this year, scientists 
reported Arulenus miae, a new species of pygmy devil grasshopper, 
identified in the Philippines after a researcher saw an unfamiliar insect 
ina photo on Facebook (J. Skejo and J. H. S. Caballero Zootaxa 4067, 
383-393; 2016). 

But Pokémon Go players beware. It is one thing to conquer a world 
of imaginary magical creatures with names like Eevee and Pidgey, 
and quite another to tangle with the historical complexity of the 
International Code of Zoological Nomenclature. So, say you do 
manage to snap a picture of something previously unknown to science 
— what then? Let Nature be your guide. 

First, the good news — the Code (we'll call it that from now on to 
save on Twitter characters) is now officially with the times, and no 
longer reliant on the dead trees that were so popular before you were 
born. Despite grumbles from traditionalists, in 2012 the International 
Commission on Zoological Nomenclature, which hosts the Code, 
agreed to embrace online-only media. In doing so, it relaxed its 
rule that species could be officially named only in printed academic 
journals. 

Now, the bad news — if your picture of an unusual butterfly or 
bird or hippopotamus does look to a friendly online biologist like a 
new species, then you'll probably have to go back and catch the beast. 
(Whisper it, but you might even have to kill it.) 

If you think that sounds too difficult, then some (but not many) 
zoologists agree with you. What's ina name? To most biologists, it’s still 
a fresh corpse — as two scientists discovered last year when they tried 
to identify and name a new species of bombyliid (bee fly) that they 
had caught and photographed in South Africa. Convention demands a 
reference sample called a holotype in exchange for a formal taxonomic 
listing, but — disaster! — the fly escaped before it could be killed 
and immortalized. Unwilling to chase it down again, the cheeky duo 
chanced their arm and showed the photographs alone as evidence, in 


the stead of their long-flown bee fly (S. A. Marshall and N. L. Evenhuis 
ZooKeys 525, 117-127; 2015). 

“Digital collectors” are fast outnumbering specimen collectors, the 
scientists warned, and new conservation rules are making it harder 
to collect and transport real species samples. The biodiversity com- 
munity is going to have to move with the times, they said, and “adapt 
to growing numbers of new taxa recognized without benefit of dead, 

preserved type specimens’. High-resolution 


“What’s ina photos, they added, rather unconvincingly, 
name? To most “can often provide enough information for a 
biologists, it’s proper description”. 

stillafresh But rules is rules — and sometimes they 
corpse. e are in place for a good reason. This month, 


zoologists from around the world came down 
on the idea like a tonne of pavement ants. A photo can’t be prodded 
and turned over. It can’t be dug out of storage and have its back legs 
gently stretched. It can’t have its genitals re-examined. In other words, 
it can't have science done with it — not proper, verifiable, reproduc- 
ible and falsifiable science. “The spirit of the International Code of 
Zoological Nomenclature is willfully violated by a description based 
only on a photograph,” their report concluded (D. S. Amorim et al. 
Zootaxa 4137, 121-128; 2016) 

Still, hippos don’t run that fast and can’t be that hard to catch. Let’s 
say our brave Pokémon Go player really does have a new species on 
their hands, and not just a high-resolution photo. What they want 
to know, of course, is can they name it after their mum? Here, the 
zoologists are remarkably relaxed. Unlike planets, chemical elements 
and human diseases, almost anything goes with wildlife. (Hence Dick 
Cheney, George W. Bush and Donald Rumsfeld each share their name 
with a slime-mould beetle.) Be prudent, though — your idol or hero 
may not appreciate the attention. 

The ecologist Cheng-Bin Wang last month poetically named a new 
beetle after Chinese President Xi Jin-Ping to honour “his leadership 
making our motherland stronger and stronger” (C.-B. Wang Zootaxa 
4126, 287-294; 2016). Chinese censors responded with some aug- 
mented reality of their own, attempting to scrub all mentions of the 
beetle from the Internet. = 


Two wheels good 


As cycling enjoys a popularity surge, science 
can help riders to get faster and stay safer. 


number of amateur cyclists are stretching their legs on two 

wheels, and stretching their Lycra as they do so. The resur- 
gence in the popularity of cycling in many countries is a twenty-first- 
century success story, and one that offers some hope as the world 
searches for sustainable cities and transport systems. 

Still, as we discuss in a News Feature on page 338, despite advances 
in materials and technology (and an obsession with shedding weight), 
the geometry of a bicycle frame this year looks, to a physicist at least, 
pretty much the same as last year, and as it almost always has. 

Where is the innovation? The Ford Motor Company has patented 
an inflatable bicycle frame, although (not surprisingly) the invention 
tries more to tackle the problem of how a bicycle can be transported, 
say in the boot of a car, than how it can be ridden. 

For raw speed, enthusiasts should head not to the peaks of France, 
but to Battle Mountain, Nevada, this September. On a 200-metre run 
of State Route 305 (closed to traffic), cyclists will compete to break 
the human-powered land-speed record. The annual event draws 
engineers and riders from across the globe, to roll machines that look 


A s the Tour de France enters Paris this weekend, an increasing 
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from a distance like eggs along one of the world’s straightest, flattest 
and smoothest roads. The current record is just under 135 kilometres 
per hour. 

The frames of these speed machines do look different from your 
standard racer. For better aerodynamics, they are recumbent bicycles 
(prone bikes, on which the rider lies chest down, have also been built). 
And this is where the physics of cycling does have an important role — 
some teams are experimenting with rear-wheel steering, which offers 
a shorter, and so more efficient, chain. 

Rear-wheel steering is less stable than front-wheel. Yet, as most 
people who have got upright on two wheels will recognize, bicycles 
in general are more stable than they look to non-riders and learners. 
Given a push, most will even stay upright without a rider. Sometimes, 
it takes a person trying to control a bike to make it fall over. 

The majority of times that a rider and bike do fall over and cause 
injury, there is no one else involved. And often the problems come 
at the start or end of the journey, or when a rider is forced to stop en 
route — especially for older people. A paper this year in Safety Science 
found that 20% of injuries to older cyclists come when the riders are 
trying to get on or off (R. Dubbeldam et al. Safety Sci. http://doi.org/ 
bmgf; 2016). This is where science can help. The study analysed the 
mounting and dismounting strategies of cyclists young and old, and 
saw that older people — perhaps because of how they were taught 
— tend to begin and brake with one foot hopping along the ground 
as the wheels turn, which is less stable than starting or stopping with 
both feet on the pedals. You never forget how to ride a bike. But some 
people need a refresher. m 
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observed plants through his microscope some 350 years 

ago. They have adapted and adopted new technologies in 
molecular biology, chemistry and microscopy. But unlike research- 
ers in genomics and other branches of biology, they have yet to fully 
embrace computer science. 

The days of staring at microscope images to explain cell processes, 
as Hooke did, are over. Microscope data today are diverse and highly 
variable, and contain complex cellular patterns that are often invisible to 
the human observer. Only computers and digital vision can find them, 
and so identify cell and molecular dynamics. Computer vision can help, 
for example, to map spatial organization of subcellular components and 
understand how they work together to generate cell behaviour. 

Digital tools already help cell biologists to carry out routine tasks: 
measuring cell shape or detecting and tracking 
intracellular components. More complex analysis 
should be a full-time speciality, but cell biologists 
do not treat it as such. Rather than embrace a truly 
interdisciplinary approach, with experts with 
different skills working together, cell biology has 
opted for hybridization. Academic departments 
and principal investigators seek individuals 
who can do both the biology and the computer 
analysis, and so encourage students and postdocs 
to train in both fields. For young scientists, this 
comes ata cost. Doing top-notch biology or com- 
puter science requires long training, expertise and 
attention. Either is a full-time job — aiming to 
train experts in both produces Jacks of all trades, 
masters of none. 

My postdoctoral research is in computational 
cell dynamics, and I would like to continue in this field as an inde- 
pendent investigator. I recently discussed my academic future with 
several established principal investigators and department chairs from 
computer science, engineering and biomedical sciences: they were 
concerned about my inherent dependence on experimental partners 
to produce data, and warned it would undermine my ability to secure 
grants. To make myself more attractive to employers and funders, they 
suggested, I should learn to perform simple cell-biology experiments. 

A colleague faced a similar reaction when she proposed to apply 
computer vision to study cell migration in 3D environments. One 
reviewer commented that analysis of other people’s data was a weak- 
ness. My colleague decided to heed the advice and follow the hybrid 
path. What a waste! Instead of using her unique expertise, she now 
spends valuable time learning basic bench science that repeats what 
others already do and will yield only standard data. 

Funders laud the importance of team science, but for cell biology it 
seems those teams must be made up of otherwise independent scien- 
tists. True collaborative science should follow industry by including 


Cs biologists have come a long way since Robert Hooke first 


COLLABORATIVE 
SCIENCE SHOULD 


INCLUDE 
SPECIALISTS 


WHO CAN'T DO 
SCIENCE WITHOUT 


THE TEAM. 


Cell biologists should 
specialize, not hybridize 


Dry cell biologists, who bridge computer science and cell biology, should have 
a pivotal role in driving effective team science, says Assaf Zaritsky. 


specialists who simply can't do science without the team. 

When computer scientists work on collaborative projects, too often 
we are expected only to provide a service: to develop custom tools 
and crunch data collected by the experimentalists. In this mode, the 
computational collaborator is required only to solve a given technical 
challenge. Accordingly, the emerging field of bioimage informatics 
develops algorithms and tools to analyse biological images at high vol- 
umes and throughput, but its main motivation is to outperform other 
algorithms on common problems such as registration, detection and 
segmentation. It demands no deep understanding of the underlying 
biological aspects, which limits what it can achieve. 

The role of a quantitative collaborator can go further. Discovery of 
complex dynamic patterns requires knowledge of the biological pro- 
cess, the experimental possibilities and the type of information that 
can be extracted from data. Such insight into 
where to look, what to look for and how to look 
for it leads to more productive collaboration, in 
which quantitative scientists also drive biological 
inquiry. They adopt the motivation, terminology 
and intellectual framework of biology. This gives 
the research a more solid quantitative basis — just 
as genomics and computer science have combined 
to develop bioinformatics. 

This type of partnership has also produced 
major innovations in evolutionary biology and 
proteomics. It will not happen in cell biology 
while we insist on retraining computer scientists 
to do undergraduate biology. 

To promote better collaboration, we need to 
nurture quantitative scientists with scientific 
motivations in cell biology. We can call them dry 
cell biologists. These scientists work at the boundaries of quantitative 
disciplines and cell biology. They identify scientific problems, steer (but 
do not perform) experiments, import, adapt and apply quantitative 
tools, and interpret the data to conceive testable numerical predictions. 

Importantly, dry cell biology is different from theory or in silico 
biology. Theoreticians build mathematical models and simulations to 
explain an observation. Dry cell biologists, by contrast, extract informa- 
tion from data to learn about biological processes. 

There are considerably more biologists who can generate visual data 
than can process and interpret it effectively. Thus, dry cell biologists 
will fill in a necessary piece of this puzzle, working hand in hand with 
conventional cell biologists to drive the subject forward. 

In some disciplines, such as structural biology, the value of dry 
research is taken for granted. It is time for cell biology to embrace us. m 


Assaf Zaritsky is a postdoc investigating computational cell dynamics 
at the University of Texas Southwestern Medical Center, Dallas. 
e-mail: assafzar@gmail.com 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


NUCLEAR FORENSICS 


Reconstruction of 
1945 nuclear test 


By measuring concentrations 
of stable isotopes in bomb 
debris, researchers have 
worked out the details of 
a nuclear test performed 
70 years ago. 
Scientists have long debated 
the efficiency and yield of 
the first atomic bomb, called 
Trinity, which was detonated 
in 1945 in New Mexico. Susan 
Hanson and her colleagues 
at the Los Alamos National 
Laboratory in New Mexico 
analysed bomb debris and 
measured changes in both 
the isotope ratios and the 
total levels of molybdenum 
—astable decay product of 
zirconium isotopes, which 
form asa result of nuclear 
detonation and are short-lived. 
The team then calculated the 
original concentrations of the 
zirconium isotopes and came 
up with a yield for the Trinity 
detonation that was in line 
with that officially reported. 
This approach could 
prove useful in ongoing 
nuclear non-proliferation 
and verification efforts, the 
authors say. 
Proc. Natl Acad. Sci. USA 
http://doi.org/bmjk (2016) 


Brain can retrieve 
baby memories 


‘Lost’ infant memories can be 
reinstated later in life, thanks 
to specific mechanisms in 
the hippocampus, the brain’s 
memory centre. 

Humans and other animals 
are often unable to recall 
early-life events, but these 
experiences can still affect 
the brain and behaviour 
later in life. To figure out 
how, Cristina Alberini at 
New York University in New 


Shark-tracking study shapes marine park 


Monitoring the movements of sharks can help 
researchers to advise on the areas best served by 


marine reserves. 


In the Seychelles in the Indian Ocean, a 
proposed marine protected area has been 
designed to safeguard mainly turtles and coral 
reefs. To see how well it might protect sharks, 
James Lea at the Marine Biological Association 
of the United Kingdom in Plymouth and 
his colleagues used acoustic transmitters 


over a three-and-a-half-year period in and 
around an area earmarked for a reserve in the 


Seychelles. They also tracked 25 turtles. When 


smaller one. 


to track 116 sharks (five species, including area. 


Carcharhinus melanopterus; pictured) 


York City and her colleagues 
administered electric shocks 
to the feet of infant rats and 
showed that the animals 
seemed to lose this memory 
within 24 hours. However, 
when presented with 
contextual reminders and 
another footshock several days 
later, the rats behaved as if they 
remembered the initial shock. 
The researchers pinpointed 
specific mechanisms in the 
hippocampus — including 

a shift in the relative levels 

of two forms ofa receptor 

for a molecule called NUDA 
— that are involved in the 
formation and storage of 
early-life memories. 
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The results suggest that the 
hippocampus goes through 
a critical period of rapid 
development during infancy. 
Nature Neurosci. http://dx.doi. 
org/10.1038/nn.4348 (2016) 


| GENOMICS 
Mitochondrial 
mismatch in mice 


Mouse embryos containing 
DNA from three animals 
may not survive gestation, 
and those that do could go 
on to develop reproductive 
problems — a finding with 
potential implications fora 
proposed human therapy. 


© 2016 Macmillan Publishers Limited. All rights reserved. 


they compared the areas that these animals 
favoured with two proposed reserve sizes, they 
found that the larger reserve covered about 
34% more of the animals’ movements than the 


In response to the research, the Seychelles 
government agreed to use the larger protected 


Proc. R. Soc. B 283, 20160717 (2016) 


Energy-producing 
cell organelles called 
mitochondria carry their 
own DNA, which when 
mutated can cause disease. 
Mitochondrial-replacement 
therapy is being developed 
to prevent the inheritance of 
such diseases and involves the 
replacement of the organelles 
with donor ones, resulting 
in embryos with DNA from 
three people. To look for 
potential incompatibilities 
between the mitochondrial 
and nuclear genomes, a team 
led by Shoukhrat Mitalipov 
at the Oregon Health and 
Science University in Portland 
replaced the mitochondria 
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in fertilized eggs from one 
subspecies of mouse with 
those from another subspecies. 

Embryos containing 
nuclear DNA from Mus mus 
domesticus and mitochondrial 
DNA (mtDNA) from Mus 
mus musculus produced 
healthy female progeny and 
males with reduced fertility. 
However, most of the embryos 
containing nuclear DNA from 
M. m. musculus and mtDNA 
from M. m. domesticus were 
aborted or stillborn. 

Should mitochondrial- 
replacement therapy reach the 
clinic, the authors recommend 
selecting donors with similar 
mtDNA to recipients. 

Cell Metab. http://doi.org/bmn5 
(2016) 


Shape-shifters 
made with a snap 


Materials that include a 
modular system of hinges have 
been used to build shape- 
shifting structures. 

Current methods can make 
reconfigurable structures with 
only a limited number of stable 
forms. To make such materials 
more versatile, Lorenzo 
Valdevit of the University 
of California, Irvine, and 
his colleagues created a 
triangular Teflon structure 
several centimetres across, 
with hinges that allow it to 
snap into an open or closed 
configuration. By linking 
these subunits together, the 
team built 3D structures with 
multiple stable positions 
(pictured). 

The approach could be used 
with a wide range of materials 
to make tools that have 
different forms and functions, 
the authors say. 

Adv. Mater. http://doi.org/f3qmr2 
(2016) 


Microscope can 
see under the sea 


An underwater microscope 
allows researchers to capture 
behaviours of corals and other 
marine organisms in their 
native habitats. 

Andrew Mullen at the 
Scripps Institution of 
Oceanography in La Jolla, 
California, and his colleagues 
designed a system (pictured 
left) that divers can use to take 
almost micrometre-resolution 
images (Stylophora coral 
pictured right) at depths of 
up to 30 metres. It lets marine 
biologists observe biological 
processes on the sea floor 
that cannot be easily replicated 
inalab. 

The authors used their 
instrument to study 
millimetre-sized coral polyps 
off the coast of Maui, Hawaii, 
in 2015, during the first mass 
coral-bleaching event recorded 
for the main Hawaiian 
islands. They found that algae 
colonized bleaching corals in 
specific patterns. Near Eilat 
in Israel, they discovered that 
coral polyps connect to each 
other through their openings, 
probably to share chemicals. 
Nature Commun. 7, 12093 (2016) 


NANOTECHNOLOGY 


Chlorine atom 
arrays store data 


A device that uses 
arrangements of atoms to 
encode and store information 
has orders of magnitude more 
capacity than current hard 
drives. 


RESEARCH HIGHLIGHTS Mii Saiaa¢ 


Sander Otte at Delft 


University of Technology in the 
Netherlands and his colleagues 
assembled arrays of chlorine 
atoms on a nanometre- 

sized copper surface. They 
used a scanning tunnelling 
microscope to manipulate 

the atoms and vacant spaces 
on the surface, creating many 
different arrangements that 
encode information. The 
researchers used these arrays 
to builda 1-kilobyte rewritable 
data-storage device with an 
information density as high 

as 78 terabits per square 
centimetre. 

If the device were scaled up, 
it could store the entire US 
Library of Congress in a cube 
just 100 micrometres wide, 
according to the authors. 
Nature Nanotech. http://dx.doi. 
org/10.1038/nnano.2016.131 
(2016) 


Bees like their 
pollen sweet 


Bees can taste the pollen they 
collect, and favour the sweet 
kind. 

Felicity Muth and her 
colleagues at the University 
of Nevada in Reno offered 
bumblebees (Bombus 
impatiens) an artificial flower 
containing one of three types 
of pollen — sweet, bitter or 
unflavoured. Bees presented 
with the sweet flower spent 
longer collecting pollen than 
did those offered the other 
types. When the bees were 
presented with two more 
flowers, one of which was the 
same colour as the first, few 
bees opted to change colour 


if their first flower had been 
sweet or unflavoured. 

Many plants rely on bees 
for widespread pollination, 
so plants may have evolved 
pollen just sweet enough to 
keep bees coming back for 
more, but not so sweet that one 
bee takes it all, the authors say. 
Biol. Lett. 12, 20160356 (2016) 


ALS gene linked 
to autoimmunity 


A gene that is often 
mutated in people with a 
neurodegenerative disease 
may help to keep immune 
responses in check. 

Mutations in the C9ORF72 
gene are commonly found in 
people with motor neuron 
disease, also known as 
amyotrophic lateral sclerosis 
(ALS), as well as in people 
with a form of dementia. To 
determine the function of 
the COORF72 protein, Kevin 
Eggan at Harvard University 
in Cambridge, Massachusetts, 
and his colleagues introduced 
mutations that diminished or 
eliminated the function of the 
equivalent protein in mice. The 
mutant mice had autoimmune 
disease, over-produced 
inflammatory molecules and 
died prematurely. 

The authors conclude 
that COORF72 has an 
important role in controlling 
inflammation and 
autoimmunity. 

Sci. Transl. Med. 8, 347ra93 (2016) 


> NATURE.COM 

For the latest research published by 
Nature visit: 
www.nature.com/latestresearch 


21 JULY 2016 | VOL 535 | NATURE | 327 


© 2016 Macmillan Publishers Limited. All rights reserved. 


SEVEN DA 


New UK government 
Theresa May became UK 
prime minister on 13 July 

in the wake of the country’s 
Brexit referendum, and quickly 
proceeded to appoint a new 
government. A newly created 
Department for Business, 
Energy and Industrial Strategy 
(BEIS), led by former science 
minister Greg Clark, will take 
on responsibilities that include 
science and energy policy; 

the Department of Energy 
and Climate Change will be 
disbanded. Jo Johnson stays 
on as universities and science 
minister, with his brief split 
between the Department of 
Education and BEIS. See page 
331 for more. 


African institute 
The Wellcome Trust and the 
Howard Hughes Medical 
Institute (HHMI) announced 
on 18 July that they will 
merge their South African 
medical research institutes. 
The HHMI-backed KwaZulu- 
Natal Research Institute 

for TB-HIV (K-RITH) in 
Durban will join the Africa 
Centre for Population Health 
in Mtubatuba, backed by 
Wellcome, to create the Africa 
Health Research Institute. 
Supported by a US$83-million 
grant from both funders, the 
institute will link K-RITH’s 
laboratory facilities with 

the Africa Centre’s detailed 
longitudinal population- 
health data to combat the 
epidemics of HIV and 
tuberculosis. 


Romania joins CERN 
On 17 July, Romania became 

a member of CERN, Europe's 
particle-physics laboratory 
near Geneva, Switzerland. 

The former socialist republic 
became a fee-paying 

candidate member state of the 


The news in brief 


Turkey turmoil prompts academic purge 


In the wake of the failed coup in Turkey on 

15 July, the Turkish Council of Higher Education 
(YOK) has told university rectors to identify 
academics and administrators with connections 
to the Giilen movement — which Turkey's 
President Recep Tayyip Erdogan considers to be 
behind the coup — and take steps to expel them. 
YOK made the statement during an emergency 
meeting of 165 university rectors on 18 July in 


organization in 2010, following 
a government decision to 
participate in large European 
infrastructures and projects 

as a way of increasing its own 
scientific strength. Romania is 
CERN’s 22nd member state. 


GM food labels 

The US House of 
Representatives has passed a 
bill that would create a national 
labelling requirement for many 
foods that contain genetically 
modified (GM) ingredients. 
The bill, passed by the House 
on 14 July and by the Senate on 
7 July, will now go to President 
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Barack Obama, who is 
expected to sign it into law. The 
requirement allows the labels 
to include links to websites 

that contain the required 
information; foods that are 
genetically altered in ways that 
could be found in nature will 
not need to be labelled. 


Navy sonar ruling 
An authorization that allowed 
the US Navy to use a type of 
sonar that might be harmful 
to marine mammals was 

in violation of the Marine 
Mammal Protection Act, 

a federal court ruled on 

15 July. A 2012 decision 

had allowed the navy to use 


Ankara. The council did not invite a further 

28 rectors to the meeting, saying that their 
universities are suspected of being pro-Giilenist. 
Some of these institutions will be taken over by 
the state, YOK said. The council also ordered all 
1,577 university deans to resign. Separately, the 
international Committee on Space Research 
cancelled its biennial conference, due to starton 
30 July in Istanbul, on security grounds. 


low-frequency active sonar for 
training, testing and routine 
operations — despite protests 
by environmental groups that 
it disrupts feeding and mating 
patterns in certain marine 
animals, including whales and 
dolphins. The navy uses the 
sonar to detect submarines. 


Chipmaker buyout 
Japanese technology 

company SoftBank will buy 
ARM Holdings, a major UK 
computer-chip maker based in 
Cambridge, for US$32 billion, 
the company announced on 

18 July. Some industry experts 


CHRIS MCGRATH/GETTY 


COURTESY OF UT GRADUATE SCHOOL OF 
BIOMEDICAL SCIENCES AT HOUSTON 


SOURCE: J. BARRINGTON-TRIMIS ET AL. (2016) 


have voiced concerns over the 
takeover, citing ARM’s role 

asa pillar of the British tech 
industry. But members of the 
new UK government said the 
deal showed that Britain was 
still attractive to international 
investors after the country’s 
vote to leave the European 
Union. ARM designs the chips 
used in most smartphones. 


PEOPLE 


Trump picks Pence 


Donald Trump, the Republican 
party’s presumptive nominee 
for US president, selected 
Indiana governor and former 
congressman Mike Pence as 
his vice-presidential running 
mate on 15 July. A religious 
conservative who could help 
Trump to shore up support 

in the Republican party, 
Pence has spoken out against 
embryonic-stem-cell research 
and prevaricated when 

asked whether he believes 

in evolution. He has also 
called the science underlying 
global warming “very mixed” 
and has a 4% rating with 

the League of Conservation 
Voters, an advocacy group 
that tracks politicians’ votes on 
environmental issues. 


Cancer pioneer dies 
Alfred Knudson (pictured), 
who proposed that cancers 

can be driven by accumulated 
mutations, died on 10 July, 
aged 93. In a seminal 1971 


paper on the epidemiology 
of retinoblastoma (a tumour 
of the retina), he found that 
people typically develop the 
tumour either as children 

or much later in life. He 
surmised that children with the 
condition inherit a defective 
copy ofa cancer-causing gene 
and develop the cancer after a 
mutation arises in the second 
copy of the gene. However, 

in the adult form, the cancer 
develops only after non- 
inherited mutations occur in 
both copies. 


Australian cabinet 


Australia’s freshly re-elected 
prime minister, Malcolm 
Turnbull, announced a new 
minister for science on 18 July, 
one of several appointments 
to his government. Former 
environment minister Greg 
Hunt, the architect of the 
governments climate policy, 
will now be in charge of 
innovation, industry and 
science. Turnbull, whose 


conservative coalition was 
re-elected with a narrow 
majority earlier this month, 
unveiled Josh Frydenberg 

as the next environment 
minister, who will also take 
responsibility for energy. Some 
climate analysts think that 
combining the portfolios could 
help to reduce the country’s 
reliance on fossil fuels. 


| FUNDING 
Columbia costs 


Columbia University in New 
York City has agreed to pay 
US$9.5 million in a settlement 
for seeking “excessive cost 
recoveries” from the US 
National Institutes of Health 
(NIH) in connection with 
hundreds of research grants, 
the US justice department 
announced on 14 July. 
Columbia is alleged to have 
repeatedly billed the NIH at a 
higher ‘on-campus rate, rather 
than a cheaper ‘off-campus’ 
rate, for research that was 
mainly done off campus, the 
settlement said. 


| RESEARCH 
Dengue dip 


The introduction of 
mosquitoes carrying a gene 
that kills their offspring has 
apparently reduced the number 
of new cases of dengue fever 

in asmall trial in Brazil. The 
incidence of dengue, which is 
caused by a virus transmitted 


E-CIGARETTES BOOST NICOTINE USE 


TREND WATCH 


Electronic cigarettes were thought 
by many to have simply replaced 
cigarettes among smokers, but an 
analysis of 5,490 schoolchildren in 
California reveals that a growing 
number of potential non- 
smokers are taking up ‘vaping’ 

— indicating a rise in nicotine use 
among teenagers (J. Barrington- 
Trimis et al. Pediatrics http:// 
doi.org/bmnvy; 2016). The study, 
published on 11 July, shows that 
in 2004, 9% of 17-18-year-olds 
smoked (conventional cigarettes), 
but by 2014, 14% were smoking 
or vaping. 


Fewer Californian teens smoke cigarettes than they did 20 years ago 
— but with the emergence of e-cigarettes, nicotine use is on the rise. 
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SEVEN DAYS | THIS WEEK | 


18-22 JULY 

The 21st International 
AIDS Conference takes 
place in Durban, South 
Africa. 

www.aids2016.0rg 


25-27 JULY 

The University of 
Aveiro in Portugal hosts 
the 7th International 
Conference 

on Advanced 
Nanomaterials. 
www.anm2016.com 


by the Aedes aegypti mosquito, 
fell by 91% (to 12 new cases, 
from 133 the previous year) 

ina district of the southern 
city of Piracicaba. Genetically 
modified A. aegypti were 
released in the district in 
April 2015. By comparison, 
incidence dropped by just 52% 
in the rest of the municipality. 
Oxitec, the biotechnology 
company in Oxford, UK, 

that developed the mosquito, 
announced the results on 

14 July. Philip McCall, an 
entomologist at the Liverpool 
School for Tropical Medicine, 
UK, says that it is possible 

that other control efforts were 
responsible for the reductions, 
but calls the trial a “step in the 
right direction” 


Greenland ice loss 


Greenland lost around 1 trillion 
tonnes of ice between 2011 

and 2014, the European Space 
Agency (ESA) revealed on 

12 July. The loss corresponds 
to an annual sea-level rise that 
is about twice the average rate 
in the previous two decades. 
An analysis of data from ESAs 
CryoSat satellite shows that 
five thinning glaciers, which 
cover less than 1% of the area 
of Greenland, were responsible 
for more than 12% of the 

total loss (M. McMillan et al. 
Geophys. Res. Lett. http://doi. 
org/bmns; 2016). 
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UK prime minister Theresa May meets astronaut Tim Peake, days after taking up her new post. 


New Brexit government 
spells shake-up for science 


Theresa May promotes a former science minister and abolishes climate-change department. 


BY DAVIDE CASTELVECCHI 


hree weeks after UK voters chose to 
‘Tiers the European Union, the coun- 
try has a new prime minister, Theresa 

May — anda revamped administration that is 
poised to change science’s place in government. 
An overall atmosphere of chaos has reigned 
since the referendum, and May’s ascendance 
has brought a measure of stability. Notably for 


researchers, she announced on 16 July that the 
universities and science minister Jo Johnson 
would continue in his role. 

But since she took charge, May has also 
shaken up the government in ways that affect 
researchers. On 15 July, she created a new 
department by merging two existing ones — the 
Department for Business, Innovation and Skills 
(BIS), which had overseen the United King- 
dom’s annual £4.7-billion (US$6.3-billion) 


research budget, and the Department of 
Energy and Climate Change (DECC). And she 
appointed Greg Clark, a former science minis- 
ter, to lead the new Department for Business, 
Energy and Industrial Strategy (BEIS) . 
Science-policy experts welcome the 
appointment of Clark. “People will be happy 
that he has played that role in the past and 
understands the importance of science and of 
the research system,” says Kieron Flanagan, 
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| NEWS IN FOCUS 


> ascience-policy researcher at the 
Alliance Manchester Business School. 

It is less clear what the aboli- 
tion of DECC will mean for climate 
research and policy, and some fear 
it will weaken the United Kingdom's 
responsibilities on climate change. 
“DECC’s disappearance raises urgent 
questions,’ said Angus MacNeil, who 
leads the cross-parliamentary Energy 
and Climate Change Committee. 

Bob Ward, a climate-change policy 
expert at the London School of Eco- 
nomics and Political Science, warns 
against jumping to the conclusion that 
May will be timid on climate action. 
“From what we know of her state- 
ments on climate change, she under- 
stands the importance of it? he says. 
“T dont think the fact that there is no 
climate change in the title necessarily 
means a reduction in commitment.’ 

A more consequential change will 
come if some of the civil servants who 
are experts on climate change are lost in the 
transition. DECC’s staff understood not only 
climate science but also its effects on regional 
weather and the need for UK infrastructure to 
adapt, says Corinne Le Quéré, director of the 
Tyndall Centre for Climate Change Research 
at the University of East Anglia in Norwich. 

May’s own recent rival for the prime 
minister position, Andrea Leadsom, is the 
new environment secretary, and May has 
moved responsibilities for higher education 
from BIS to the education department, which 
is now under Justine Greening. 

Asa result, Johnson’s brief will now be split 
between the education department and BEIS. 
Universities and schools are very different, 
notes Nick Hillman, director of the Higher 
Education Policy Institute in Oxford. But, he 
adds, the reorganization could work provided 
Johnson has “considerable autonomy” within 
the education department. Importantly, John- 
son's reappointment signals that May plans to 
push on with reforms of higher education and 
of research funding begun by her predecessor, 
David Cameron. 


STABILITY, FOR NOW 

May’s appointment of Philip Hammond 
as Chancellor of the Exchequer, a job that 
includes overseeing the state’s budget 


2 
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Former science minister Greg Clark leads a newly created department. 


including allocations to research, is expected 
to provide stability, too. “He is seen as being a 
person who doesn't make headline decisions, 
so we might have some hope of status quo at 
the very least with regard to research funding,” 
says Jenny Rohn, who chairs the UK lobby 
group Science is Vital. 

But the country still faces the uncertainty 
of Brexit. May, who became the United King- 
dom’s second female prime minister on 13 July, 
had opposed leaving the EU, like Cameron. But 
she has made it clear that under her watch, the 
United Kingdom will leave the EU, and has 
sought to differentiate herself from Cameron's 
government. She has sacked some of its key 
members, and filled some posts with pro- 
Brexit politicians, putting Boris Johnson, the 
pro-Leave former mayor of London, in charge 
of the Foreign Office, for example. She has 
also created a post for leading the negotiations 
that will determine the United Kingdom's new 
relations with the EU. 

Scientists are still in the dark about how May 
herself views science. She ran virtually no for- 

mal campaign for prime 


> NATURE.COM minister and has given 
For the latest little indication of what 
on Brexit and her policy priorities on 
science, see: research would be. Ina 
go.nature.com/29x6wid speech in Birmingham 
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on 11 July, she called for “a better 
research and development policy that 
helps firms to make the right invest- 
ment decisions” and “an energy pol- 
icy that emphasizes the reliability of 5 
supply and lower costs for users’, but 
did not go into further detail. 


HALL/REUTERS 


PRAGMATIC OPERATOR 

In her former role as home secretary, 
May’s push to reduce immigration 
and tighten up visa requirements 
sometimes put her at odds with 
university administrators. She also 
helped to push through a bill that 
enacted a blanket prohibition on so- 
called designer drugs. But she has 
shown pragmatism and a willingness 
to change her mind when presented 
with compelling evidence, says Paul 
Nightingale, deputy director of the 
Science Policy Research Unit at the 
University of Sussex. 

“She has been an example of good 
practice in gathering evidence, and also of 
explaining her decisions when they have not 
gone with the scientific advice,” says Sarah 
Main, director of the London-based Campaign 
for Science and Engineering. 

May’s past decisions do not necessarily give 
a good indication of how she will act as prime 
minister, Flanagan says. As a home secretary 
positioning herself to be the next prime min- 
ister, May had to seem tough on issues such as 
immigration and security, he says. 

Central to scientists’ concerns are the terms 
of the Brexit negotiation. In particular, they 
want reassurance that they will still be able to 
access EU science funding and to easily hire 
people from throughout the bloc. “We need 
to make sure people understand that, perhaps 
unintentionally, the UK research community 
does have a lot to lose,’ says Main. 

The community is anxious for the govern- 
ment to commit to shielding them from the 
possible impacts of Brexit. “The challenge 
would be for the new government and the new 
prime minister to set out clearly what is their 
ambition for science and research to play a part 
in the UK future,’ Main says. “We would like to 
see that clearly stated” m 


Additional reporting by Richard Van Noorden 
and Elizabeth Gibney. 
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First Greek science agency 
is rare source of joy 


European Investment Bank provides surprise loan to halt startling brain drain. 


BY ALISON ABBOTT 


massive loan from the European 
Atte investment bank gives Greek 
researchers their first ray of hope since 
the debt crisis hit six years ago: a government- 
backed plan to create a Greek research agency. 

Dubbed the Hellenic Foundation for 
Research and Innovation (HFRI), it will issue 
regular calls for basic research proposals, fill 
requests for research equipment and award 
fellowships for young scientists. 

Its funder, the European Investment Bank 
(EIB) in Luxembourg, generally finances pro- 
jects that contribute to growth and employ- 
ment, including many green-energy and 
water-security programmes. So the decision 
to support Greek research came asa surprise to 
some. “At first we hadn't believed that the EIB 
would pay for blue-skies research,’ says Greek 
research minister Costas Fotakis, a former 
laser physicist. 

The EIB says that the HFRI is crucial to halt- 
ing Greece's startling brain drain: graduates 
and postgraduates have left Greece in droves 
since the crisis began. 

On 15 July, the bank approved the 
€180-million (US$200-million) loan, which the 
Greek government will top up with €60 million. 
The total will support the HFRI during its first 
two-and-a-half years. The EIB rarely loans more 
than 50% of the cost ofa project, but the 28 EU 
member states voted unanimously to make an 
exception, says EIB spokesman Richard Willis. 

“Given the key issues around fundamental 
research in Greece, 75% was seen as justified,” 
he says. “It is about competitiveness, forward- 
thinking and the future.” 

The EIB does not make the detailed condi- 
tions of its loans public, but Willis says that the 
terms for the HFRI are very attractive and that 
Greece has a full 15 years to repay the money. 
If the HFRI is successful, the bank will consider 
additional loans in the future, he adds. 

A parliamentary bill to establish the agency 
is currently being discussed. It proposes that 
the HFRI operate along the lines of the US 
National Science Foundation and the DFG, 
Germany’s main funding agency — and with- 
out direct political influence. 

“For the first time we'll have some regularity 
in research funding in Greece,” says Nektarios 
Tavernarakis, director of the FORTH Institute of 


a Se 


Senn 


Blue-skies research in Greece gets a boost after six years of economic strain. 


Molecular Biology and Biotechnology (IMBB) 
in Heraklion, Crete. “It will definitely help make 
Greece a more attractive place to do research” 

Tavernarakis, who in April won his second 
grant from the prestigious European Research 
Council, knows from bitter experience how 
hard it is to persuade ambitious young scien- 
tists to join even a high-performing research 
institute like his own. When the debt crisis hit 
a peak last July, triggering bank closures, the 
resulting controls on the flow of capital out of 
the country frightened off candidates who had 
previously been prepared to work at the IMBB, 
he says. “And that was quite understandable” 

Fotakis says that the HFRI is a key part of 
government strategy to increase the country’s 
potential. He blames the brain drain on the 
austerity measures imposed by the European 
Commission, the European Central Bank and 
the International Monetary Fund in exchange 
for keeping the country from bankruptcy. “They 
have lowered wages and pensions, and made all 
aspects of research difficult,’ he says. 

Because parliament is unlikely to pass the 
HFRI bill before September, Fotakis plans to 
launch calls for some PhD and postdoctoral 
grants immediately. Even before the crisis, 
Greece has been one of the lowest investors in 
research in the EU. But once the crisis began, 


calls for research proposals, which were always 
modest and irregular, nearly disappeared. 
Scientists had to survive on international fund- 
ing sources, particularly those from the EU. 

The country still manages to host several 
internationally competitive research centres, 
but Fotakis says that even this is not sustain- 
able. “Scientific output is obviously not going 
to be sustained for ever when the brain drain 
is so intense.” 

The initial total of €240 million earmarked for 
the agency will support fundamental research, 
but the HFRI will eventually support a second 
stream for translational research. Beyond 2018, 
Fotakis expects the government to support the 
fundamental research stream with an annual 
€20 million. “And now that the EIB has demon- 
strated its confidence in Greek science with this 
loan, it'll be much easier to attract other sources 
of financing to the HERI’ he says. 

Greece fixed a few more bugbears when it 
passed a general law on research in May — for 
example, researchers will now be able to top up 
salaries for themselves and their groups if they 
bring in outside grants. 

No one imagines that the problems of Greek 
science are now solved. “Still? says Tavernara- 
kis, “these measures show things are at least 
going in the right direction” = 
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A Filipino ship and a Chinese coast-guard vessel pass each other in the South China Sea. 


CONSERVATION 


Ecosystem fear in 
South China Sea 


Court decision escalates tensions in ecologically sensitive 
region, but may also push nations to cooperate. 


BY DAVID CYRANOSKI 


n international tribunal has ruled 
Av China’s territorial claims in the 

South China Sea — and ecologists are 
worried. They warn that the decision, which 
China has vowed to ignore, could escalate 
tensions between China and its neighbours, 
leading to increased competition for fish and 
a subsequent collapse in stocks. 

“Now catastrophe looks much closer than 
before,” says John McManus, a marine ecolo- 
gist at the University of Miami in Florida. “The 
Hague decision stirred the pot.” 

But McManus and others also note that the 
ruling, by the Permanent Court of Arbitration 
in The Hague, may nudge China to cooper- 
ate more with regional rivals on scientific and 
environmental issues. 

The Philippines brought the case in 2013 
following China’s occupation of a reef, called 
Scarborough Shoal, that both countries claim. 
On 12 July, the court upheld Philippine alle- 
gations that China had unlawfully restricted 
access to its fisherman there. And it rebuked 
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China for its broader territorial claims within 
the ‘nine-dash line’ that envelops most of the 
South China Sea (see ‘Disputed waters’). 

China argues that certain geological fixtures 
in the sea are islands that belong to its sover- 
eign land and that the waters that surround 
them are within its ‘exclusive economic zone. 
The court found that the fixtures are merely 
rocks or “low-tide elevations”. 

The tribunal also said that China had vio- 
lated the United Nations Convention on the 
Law of the Sea by constructing artificial islands 
that damaged coral reefs, and by failing to stop 
its fishers from snatching protected species. 

The judgement is legally binding, but China, 
which views the South China Sea as increas- 
ingly important strategically, looks likely to 
ignore it. “China neither accepts nor recog- 
nizes it,” says a statement from the foreign 
affairs ministry. The region contains more 
than 1 billion tonnes of oil reserves and is an 
important nexus connecting the Pacific and 
Indian oceans. “No one will force China from 
those islands if they don’t want to go,” says Tim 
Johnston, Asia program director for the 


Brussels-based International Crisis Group. 
Days after the ruling, China cordoned off a 
swathe of the area for military exercises. 

Military escalation could follow. For 
instance, the tribunal found that Mischief Reef 
is within the Philippines’ exclusive economic 
zone but China has built a military base there, 
and, since the ruling, landed an aircraft there. 

Ecologists will be watching out for an esca- 
lation in fishing, in particular at the Scarbor- 
ough Shoal. Fish densities and catch rates in 
the South China Sea have plummeted in past 
decades. McManus’ research suggests that the 
reef sits at the centre of a crucial region from 
which many coastal fishing stocks are replen- 
ished. He fears that the ruling — which found 
that China and the Philippines share fishing 
rights there — will lead both to increase their 
fishing activity. “One will try to get there before 
the other one,’ says McManus. “When we have 
a present fisheries crisis and a looming fish- 
eries catastrophe, you shouldnt go to one of 
the most important places for fisheries and 
destroy it?” Such a rush could affect hundreds 
of species, he says, and permanently reduce the 
numbers of some, including sea turtles, sharks 
and giant clams. 

Another problem for conservationists is the 
destruction of coral reefs. McManus estimates 
that 162 square kilometres of reef has been 
destroyed, almost all of it by the Chinese in the 
past few decades. His calculations, which he 
presented at the South China Sea Conference 
on 12 July, suggest that Chinese cutter boats 
that hunt endangered giant clams account for 
104 km? of the damage, with 55 km’ a result of 
island building by dredging sediment from the 
ocean floor and dumping it on the reefs. 

A statement by China's ministry of foreign 
affairs says that island building is carried out 
“based on thorough studies and scientific 
proof” and that “the impact on the ecological 
system of coral reefs is limited”. But scientists 
have long said that the island building has to 
stop. Kwang-Tsao Shao, a marine-biodiversity 
expert at Taiwan's Academia Sinica in Taipei, 


DISPUTED WATERS 


An international tribunal has ruled that China’s 
territorial claims based on the ‘nine-dash line’ 
in the South China Sea are invalid. 
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says that at meetings that include his mainland 
peers, there is consensus from ecologists on 
both sides of the strait that the region should 
be set aside as a marine protected area. 
Scientists in the region do already collabo- 
rate: Zhifei Liu at Tongji University in Shanghai, 
China, leads a UNESCO research project 
which brings together scientists from China, 
the Philippines and other countries to study 
how sediment is deposited and transported 
in the South China Sea. Liu says that this and 
other China-funded projects will be unaffected 


by the ruling, although he worries that 
Philippine collaborators might now drop out. 

But scientific interest and environmental 
objectives in the area could strengthen diplo- 
macy, by giving adversaries a reason to sit down 
and iron out priorities of mutual interest. The 
tribunal ruling may be what is needed to spark 
such dialogue, says Johnston. China — although 
bullish in its responses so far and unlikely to 
give up its claims — will want to at least look 
as though it is acting fairly and for the com- 
mon good. “We are hoping this may tip the 
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balance and persuade China that constructive 
negotiation is the way forward,’ he says. 

Since the 1990s, McManus has advocated 
for a jointly run marine ‘peace park in the 
region. “It’s the only way to avert a fisheries 
collapse. It will have a good effect on efforts 
to avert military action,’ he says. The tribunal 
decision makes him slightly more optimistic. 
He now gives the parka 1 in 10 chance of hap- 
pening rather than 1 in 100. “If China can just 
sit there thinking ‘this is all ours, we wont get 
anywhere,’ he says. = 


EPIDEMIOLOGY 


Social cycle aids HIV spread 


Genetics study shows older men and young women drive South Africa’s epidemic. 


BY AMY MAXMEN 


is no secret in South Africa. The name 

‘blesser’ is commonly used to describe a 
man who may at first pay for a teenager's bus 
fare to high school, then buy school supplies 
she cannot afford, and perhaps lunch at a 
decent café. Over time, the adolescent sleeps 
with her provider. 

A genetic analysis now suggests how this 
social phenomenon plays into the cycle of 
HIV transmission in the country, which has 
the world’s largest HIV epidemic. By analysing 
the similarity of viral genetic sequences from 
nearly 1,600 people with HIV in one com- 
munity in KwaZulu-Natal, the study shows 
that adolescent girls and women in their early 
20s tend to pick up the virus from men aged 
around 30. When the women grow older, they 
go on to infect their long-term partners, who 
in turn may pass the virus on through affairs 
with younger women. 

“This is the engine driving high rates of 
HIV,’ says epidemiologist Salim Abdool 
Karim, senior author of the unpublished 
study and director of the Centre for the AIDS 
Program of Research in South Africa 
(CAPRISA). He presents the work this week at 
the International AIDS Conference in Durban. 

Karim thinks that the study adds to grow- 
ing evidence that HIV-free young women in 
regions with very high rates of HIV should 
be encouraged to take antiretroviral medica- 
tions regularly to prevent infection. The World 
Health Organization also recommends that 
people at substantial risk of HIV be offered 
what is known as PrEP (pre-exposure proph- 
ylaxis). But, owing in part to disappointing 
results in clinical trials, the South African 
government has not yet recommended PrEP 
for young women. 


\ ex between young women and older men 


Karim’s study also shows the importance 
of making broader social changes, adds 
Michel Sidibé, executive director of the Joint 
United Nations Programme on HIV/AIDS 
(UNAIDS). In parts of South Africa, 8 times 
as many teenage girls have HIV as do teen- 
age boys, and in 


some communities “This is the 

in KwaZulu-Natal, engine driving 
a 15-year-old girl high rates of 
has an 80% risk of HIV.” 


getting HIV in her 

lifetime. “Something 

that underlies the study is how common it is 
for older men to have sex with young girls. Pills 
are useful, but how can we break this silence 
around the lack of enforcement of laws that 
protect young women? How can we invest in 
the capacity of people to claim their rights and 
reduce this kind of violence?” 


VICIOUS CYCLE 
Researchers have long known of the high 
burden of HIV infections in young South 
African women, and that they get infected 
by older men, says Thomas Quinn, an epi- 
demiologist at Johns Hopkins Bloomberg 
School of Public Health in Baltimore, Mary- 
land, who was not involved in the study. But, 
he says, “it is very exciting to use molecular 
genetic information to actually show how 
the virus spreads among people” — and to 
pinpoint the ages of women and men at key 
points in the cycle of HIV transmission. 
Karim has tracked HIV genetic data for the 
past couple of years, and his work has already 
affected how governments and international 
organizations tackle the virus. He has been 
sharing results regularly with Deborah Birx, 
the US Global AIDS Coordinator who over- 
sees the US President’s Emergency Plan for 
AIDS Relief (PEPFAR), the largest federal 


funder of HIV-prevention research. 

In 2014, PEPFAR launched an initiative 
called DREAMS to protect young women from 
HIV, in response to the finding that more than 
1,000 women aged 16-24 become infected 
every day in southern and eastern Africa. 
When Birx heard details of Karim’s results last 
year, she convinced PEPFAR to target par- 
ticular demographic groups. For example, she 
says, half of the recipients of the US$85-million 
DREAMS Innovation Challenge, announced 
on 18 July, are 15- to 19-year-old girls. 

Despite the results, health officials remain 
reluctant to recommend PrEP for adolescent 
girls and young women. In June, the South 
African government endorsed PrEP for 
sex workers — but not for other high-risk 
groups, such as gay men and young women. 
One reason for its hesitancy is that the treat- 
ments have not proved effective in clinical 
trials, and blood tests have suggested that is 
because women in these trials did not consist- 
ently take their medications (either daily pills 
or a vaginal gel). 

But at the conference, the CAPRISA team 
announced another as-yet-unpublished finding 
that suggests a woman's vaginal microbiome 
may be in part responsible for the inefficacy 
of PrEP. A study of women who used the gel 
tenofovir to prevent HIV infection suggests 
that it was less effective in those who had 
Gardnerella vaginalis bacteria in their vaginal 
lining. The researchers found that G. vaginalis 
absorbed the PrEP drugs, reducing the amount 
of medication in the blood. 

This hints that more women may actu- 
ally have taken their medications than the 
disappointing blood tests from earlier trials 
suggested, Karim says — and also that treat- 
ment for an imbalance in vaginal bacteria 
might help PrEP to work better in some young 
women. @ 
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Algae speed 
up Greenland 
ice melt 


Project probes how 
microbes affect ice sheets. 


BY ALEXANDRA WITZE 


esearchers are fanning out across 
Re Greenland ice sheet this month 

to explore a crucial, but overlooked, 
influence on its future: red, green and 
brown algal blooms. These darken the ice, 
causing it to absorb more sunlight and melt 
faster. 

The £3-million (US$4-million) Black 
and Bloom project is the first to systemati- 
cally measure how algae are changing the 
amount of sunlight that Greenland’s ice sheet 
bounces back into space. “We want to know 
how much of the darkness is due to microbes 
and how much to other factors” such as soot 
or mineral dust, says principal investiga- 
tor Martyn Tranter, a biogeochemist at the 
University of Bristol, UK. 

The heart of the study is a region 
measuring half a kilometre long on each 
side, near Kangerlussuaq, Greenland. The 
researchers will spend six weeks collecting 
samples of black carbon and microorgan- 
isms while measuring incoming sunlight 
and reflectivity. The project will continue 
for two more summers, exploring different 
parts of the ice sheet. 

“They're extremely lazy algae — they 
sleep for nine months and then wake up 
and have a party,’ says team member Liane 
Benning, a biogeochemist at both the 
University of Leeds, UK, and the German 
Research Centre for Geosciences in Pots- 
dam. The blooms create vast, colourful 
fields of ‘watermelon snow. 

Last month, Benning and her team 
reported that algal blooms across the Arc- 
tic could reduce reflectivity by 13% over 
the melting season (S. Lutz et al. Nature 
Commun. 7, 11968; 2016). Their results 
should enable climate modellers to improve 
estimates of how the ice sheet — which 
holds enough water to raise sea levels by 
7 metres — is likely to melt in coming dec- 
ades. The past several years have seen record 
temperatures and melting across Greenland. 

Black and Bloom will provide “a one-of-a- 
kind data set” to help researchers understand 
Greenland’ future, says Marco Tedesco, a 
geophysicist at the Lamont-Doherty Earth 
Observatory in Palisades, New York. = 
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People line up to buy groceries outside a supermarket in Caracas. 


Q&A Claudio Bifano 
Venezuelan science 
under siege 


The political and economic crisis in Venezuela continues to worsen. As the price of oil — the 
country’s major export — has fallen, Venezuela has struggled to pay for imported goods while 


maintaining socialist economic policies put in place by former president Hugo Chavez. Claudio 
Bifano, a chemist at the Central University of Venezuela in Caracas, continues to teach despite 
rolling blackouts, food lines and increasing violence. “This is not a good time for the country, but 
we have to keep working,” he says. “We have to keep living.” Bifano, who is also the president of 
the Latin American Academy of Sciences, spoke to Nature about how Venezuelan scientists are 


coping with the turmoil. 


What is daily life like for Venezuelans? 

There is a serious scarcity of food and medi- 
cines, and there is high inflation, so the little 
that people can buy is extremely expensive. 
We depend on what the government imports, 
and as the credit that the government uses for 
imports is shrinking, it is more difficult to 
import. This is an economic model that has 
proven absolutely inefficient. 


How are the food shortages affecting society? 

We're already seeing a malnutrition problem. 
Many schools can no longer serve lunches, and 
the schools that still provide lunch, in a couple 


of states, as far as I know, serve 40% of what 
they used to provide. In a few years, we'll see 
the consequences of children and youth who 
were malnourished and had poor health care. 
Those problems will affect the country’s devel- 
opment. 


How are universities weathering the crisis? 

Universities are becoming degraded because 
of incomprehensible policies. Of the budget 
that autonomous universities receive from the 
state, 95% goes to pay personnel and only 5% 
is left for infrastructure, graduate programmes, 
research, outreach and everything else that 
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universities should do. And the salaries are 
ridiculous. 

I am a tenured professor and have been 
working at the university for 50 years. I have 
a PhD from the University of California [San 
Diego]. ’'mas high on the scale as I can go. My 
salary is 40,000 bolivares a month. At the real 
exchange rate, the rate at which I have to buy 
things in stores, I earn $40. 

Asa result, 700 professors, of a total of about 
3,000, have left the Central University. Four 
hundred have left Simon Bolivar University [in 
Sartenejas], which is a relatively small university. 
And although the Metropolitan University [in 
Caracas] is a private university, about 10% of its 
personnel have left. Why? Simply because you 
can't live on the salary that the university pays. 


Where are these people going? 

In the area of science and technology, many go 
to the United States, others to European coun- 
tries. A person with good academic training, a 
researcher or experienced professor, can get a 
job at an Ecuadorian university or go to Brazil, 
which has a policy to attract outside research- 
ers to strengthen some of its universities. 


Do research opportunities still exist in 
Venezuela? 

Fewer and fewer. With the degrading of the 
universities, laboratories are eliminated, 


resources are lost. Libraries don't have current 
journals. Laboratories can’t buy the reagents 
they need, because it’s expensive to do research 
that’s competitive internationally. 

The government says there are 16,000 
researchers here, and that it invests 2.6% of 
gross domestic prod- 
uct in science and 


“ ° 

The flight of technology, but the 
JOuNs; well- number of publica- 
trained people tions has decreased 
from Venezuela in recent years by 
is huge. practically 40%, and 


the flight of young, 
well-trained people from Venezuela is huge. 
In science and technology, it’s estimated that 
1,500 researchers have left the country. 


What future do you foresee? 
Something has to change. There are endless 
lines of people at markets, trying to buy food. 
The government controls the prices of some 
items and you can only buy them at certain 
stores. So when they say there are baby nappies 
or milk, for example, the lines are unending. 
There has also been an increase in violence. 
Around 6 p.m., when it starts to get dark, I go 
home, because I don’t feel safe on campus. 
About a month ago, there was a lynching in the 
Central University. A boy tried to snatch a girl’s 
purse, a friend came to her aid and there was a 
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fight. The assailant pulled out a pistol and shot 
the other boy, but fortunately only grazed his 
head. Then the assailant hid in the university, 
but other kids saw him, grabbed him and beat 
him to death, inside the university. 


How are students dealing with the crisis? 
They go to class, and I tell them, “Try to learn 
everything you can from your professors. Get as 
much education as you can.” But when you ask, 
“How many of you want to practise your profes- 
sion here in the country?” out of 10, two raise 
their hands. They're more material for export. 
It will take a long time to recover from this — 
at least two generations. I don't think I'll see it. m 


INTERVIEW BY BARBARA FRASER 


This interview has been edited for length and clarity. 


CORRECTION 

The article ‘Interference puts satellite data at 
risk’ (Nature 535, 208-209; 2016) wrongly 
stated that William Mahoney would lead a 
panel on spectrum-sharing at an American 
Meteorological Society meeting later 

this month. Jonathan Porter is the panel 
chairman. The story also did not make it 
clear that Ligado Networks is a satellite- 
communications company. 
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NYSICS.ON TWO WHEELS 


Jim Papadopoulos has spent a lifetime pondering bicycles in 
motion. His work, once nearly lost, has found fresh momentum. 


BY BRENDAN BORRELL 
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even bikes lean against the wall of Jim 

Papadopoulos’s basement in Boston, 

Massachusetts. Their paint is scratched, 

their tyres flat. The handmade frame 
that he got as a wedding present is coated in 
fine dust. “I got rid of most of my research 
bikes when I moved, he says. The bicycles that 
he kept are those that mean something to him. 
“These are the ones I rode.” 

Papadopoulos, who is 62, has spent much of 
his life fascinated by bikes, often to the exclu- 
sion of everything else. He competed in amateur 
races while a teenager and at university, but his 
obsession ran deeper. He could never ride a bike 
without pondering the mathematical myster- 
ies that it contained. Chief among them: What 
unseen forces allow a rider to balance while 
pedalling? Why must one initially steer right in 
order to lean and turn left? And how does a bike 
stabilize itself when propelled without a rider? 

He studied these questions intensely as a 
young engineer at Cornell University in Ithaca, 
New York. But he failed to publish most of his 
ideas — and eventually drifted out of aca- 
demia. By the late 1990s, he was working for a 
company that makes the machines that manu- 
facture toilet paper. “In the end, if no one ever 
finds your work, then it was pointless,” he says. 

But then someone did find his work. In 
2003, his old friend and collaborator from 
Cornell, engineer Andy Ruina, called him 
up. A scientist from the Netherlands, Arend 
Schwab, had come to his lab to resurrect the 
team’s research on bicycle stability. 

“Jim, you need to be a part of this,” Ruina 
told him. 


TWO WHEELS GOOD 
Together, the researchers went on to crack a 
century-old debate about what allows a bicycle 
without a rider to balance itself, publishing in 
Proceedings of the Royal Society’ and Science’. 
They have sought to inject a new level of science 
into the US$50-billion global cycling indus- 
try, one that has relied more on intuition and 
experience than on hard mathematics. Their 
findings could spur some much needed inno- 
vation — perhaps helping designers to create a 
new generation of pedal and electric bikes that 
are more stable and safer to ride. Insights from 
bicycles also have the potential to transfer to 
other fields, such as prosthetics and robotics. 

“Everybody knows how to ride a bike, but 
nobody knows how we ride bikes,’ says Mont 
Hubbard, an engineer who studies sports 
mechanics at the University of California, 
Davis. “The study of bicycles is interesting 
from a purely intellectual point of view, but it 
also has practical implications because of their 
ability to get people around.” 

For a mechanician — that fusty breed of 
engineer whose subject is defined by Newton's 
three laws of motion — the conundrums of the 


Jim Papadopoulos sees a future with radically 
new bicycle designs. 


bicycle hold a special allure. “We are all stuck 
in the nineteenth century, when there wasn't 
such a difference between math and physics 
and engineering,” says Ruina. Bicycles, he says, 
are “a math problem that happens to relate to 
something you can see”. 

The first patents for the velocipede, a two- 
wheeled precursor to the bike, date to 1818. 
Bikes evolved by trial and error, and by the 
early twentieth century they looked much as 
they do today. But very few people had thought 
about how — and why — they work. William 
Rankine, a Scottish engineer who had analysed 
the steam engine, was the first to remark, in 
1869, on the phenomenon of ‘countersteering, 
whereby the rider can steer to the left only by 
first briefly torquing the handlebars to the right, 
allowing the bike to fall into a leftward lean. 

The link between leaning and steering gives 
rise to the bicycle’s most curious feature: the 
way that it can balance while coasting on its 
own. Give a riderless bike a shove and it may 
wend and wobble, but it will usually recover its 
forward trajectory. In 1899, English mathema- 
tician Francis Whipple derived one of the earli- 
est and most enduring mathematical models of 
a bicycle, which could be used to explore this 
self-stability. Whipple modelled the bicycle as 
four rigid objects — two wheels, a frame with 
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A bicycle’s front wheel can act as a caster 
because the point at which the wheel contacts 
the ground typically sits anywhere from 5 cen- 
timetres to 10 centimetres behind the steering 
axis (see “What keeps a riderless bike upright?’). 
This distance is known as the trail. Jones dis- 
covered that a bike with too much trail was so 
stable that it was awkward to ride, whereas one 
with negative trail was a death trap and would 
send you tumbling the moment you released the 
handlebars. 

When a bicycle starts to topple, he con- 
cluded, the caster effect steers the front end 
back under the falling weight, keeping the 
bicycle upright. To Jones, the caster trail was 
the sole explanation for a bike’s self-stability. 
In his memoir, published 40 years later, he 
counted the observation as one of his great 
accomplishments. “I am now hailed as the 
father of modern bicycle theory,” he declared. 


GEARING UP 

That article would make an impression on Jim 
Papadopoulos, then a teenager in Corvallis, 
Oregon, with a gift for numbers and a home 
life in tatters. In 1967, his father Michael, an 
applied mathematician, had moved his wife 
and four children from England to the United 
States to start a job at Oregon State University. 


"EVERYBODY KNOWS HOW TO RIDE A BIKE, 
BUT NO ONE KNOWS HOW WE RIDE BIKES.” 


the rider and the front fork with handlebars — 
all connected by two axles and a hinge that are 
acted upon by gravity. 

Plugging the measurements of a particular 
bicycle into the model revealed its path dur- 
ing motion, like a frame-by-frame animation. 
An engineer could then use a technique called 
eigenvalue analysis to investigate the stability of 
the bicycle as one might do with an aeroplane 
design. In 1910, relying on such an analysis, the 
mathematicians Felix Klein and Fritz Noether 
along with the theoretical physicist Arnold 
Sommerfeld focused on the contribution of the 
gyroscopic effect — the tendency ofa spinning 
wheel to resist tilting. Push a bicycle over to the 
left and the rapidly spinning front wheel will 
turn left, potentially keeping the bicycle upright. 

In April 1970, chemist and popular-science 
writer David Jones demolished this theory in an 
article for Physics Today’ in which he described 
riding a series of theoretically unrideable bikes. 
One bike that Jones built had a counter-rotating 
wheel on its front end that would effectively 
cancel out the gyroscopic effect. But he had lit- 
tle problem riding it hands-free. 

This discovery sent him hunting for another 
force that could be at play. He compared a bike's 
front wheel to the casters on a shopping trolley, 
which turn to follow the direction of motion. 
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But Michael Papadopoulos was denied tenure 
after protesting against the Vietnam War, set- 
ting offa decade-long legal battle with the uni- 
versity that left him out ofa job and the family 
scouring rubbish bins for scraps. Jim’s mother 
killed herself in the early 1970s. “Just as I was 
opening my eyes to the world and deciding 
who I was,’ Papadopoulos says, “my family 
was falling apart” 

He found solace in bikes. He pedalled his 
Peugeot AO8 around town and grew his hair to 
his shoulders. He stopped going to classes, and 
his grades took a tumble. At 17, he dropped out 
of school and left home. But before he aban- 
doned his studies, a teacher gave him the Jones 
article. 

Papadopoulos found it captivating but con- 
fusing. “I’ve got to learn this stuff, he thought. 
He spent the summer bumming around Berke- 
ley, California, reading George Arfken’s text- 
book Mathematical Methods for Physicists in his 
spare time. Then, he worked at a plywood mill 
in Eugene, Oregon, earning enough money to 
buy the legendary Schwinn Paramount that he 
raced every weekend. In 1973, he worked for 
the frame builder Harry Quinn in Liverpool, 
UK, but he was terrible at it and Quinn asked 
him to leave. 

Papadopoulos returned to Oregon, took a 
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WHAT KEEPS A RIDERLESS BIKE UPRIGHT? 


Scientists have grappled with this question for decades. 
How does a bike moving forward without a rider stay 
upright? Even when struck from the side, it will correct 
its course and regain stability. 


THE WEIGHT DISTRIBUTION 
Bicycle researcher Jim 
Papadopoulos found these 
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few college classes and, in 1975, started under- 
graduate studies in mechanical engineering 
at the Massachusetts Institute of Technology 
(MIT) in Cambridge. He did well. Oil company 
Exxon later supported him as he studied for a 
PhD in fracture mechanics. Papadopoulos’s 
adviser, Michael Cleary, was optimistic about 
his prospects as an academic. “I think Jim will 
become a university professor — and we cer- 
tainly hope it’s going to be here at MIT,” he told 
a writer from Exxon’s in-house magazine. 

Papadopoulos had other ideas. He had 
been studying Whipple’s model and Jones’s 
article, and wrote to bike companies offering 
his services as an engineer. He didn’t receive 
a single response. Instead, Cleary helped him 
to get a position at the US Geological Survey 
in Menlo Park, California, where he first met 
Andy Ruina. 

The two became fast friends. When Ruina 
got a job at Cornell, he hired Papadopoulos as 
a postdoc. “We talked about bikes all the time, 
but I didn’t realize he wanted to make a serious 
thing about it,” Ruina says. 

Papadopoulos convinced Ruina that bicycle 
companies — like oil companies — might be 
interested in supporting academic research. So 
he started fund-raising, reaching out to bike 
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THE GYROSCOPIC EFFECT 
A spinning wheel will resist 
falling over and transfer 
tilting force into a turn. This 
could help to right a bike. 
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THE CASTER TRAIL 

A bicycle’s front-wheel steering axis sits slightly 
ahead of the point at which the wheel touches the 
ground, creating a ‘trail’ like that of an office-chair 
caster. This means the wheel will turn in the direction 
the bike is travelling (or falling, as the case may be). 
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makers. For $5,000, they could be benefactors 
of the Cornell Bicycle Research Project, an 
ambitious effort that would investigate every- 
thing from the strength of wheels to brake 
failure in the rain. 

Papadopoulos’s first goal was to finally 
understand what makes one bicycle more 
stable than another. He sat in his office and 
scrutinized 30 published attempts at writing 
the equations of motion for a bicycle. He was 
appalled by the “bad science’, he says. The 
equations were the first step towards connect- 
ing the geometry of a bicycle frame with how it 
handled, but each new model made little or no 
reference to earlier work, many were riddled 
with errors and they were difficult to compare. 
He needed to start from scratch. 

After a year of work, he had what he believed 
to be the definitive set of equations in hand. 
Now it was time for them to talk back to him. 
“T was sitting for hours at a time, staring at the 
equations and trying to figure out what they 
implied,” he says. 

He first rewrote the bicycle equations in 
terms of the caster trail, the crucial variable 
that Jones had championed. He expected to 
find that if the trail was negative, the bicy- 
cle would be unstable, but his calculations 
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suggested otherwise. In a report that he pre- 
pared at the time, he sketched a bizarre bicycle 
with a weight jutting out in front of the handle- 
bars. “A sufficiently forward [centre of mass] 
can compensate for a slightly negative trail,” 
he wrote. No single variable, it seemed, could 
account for self-stability. 

This discovery meant that there was no 
simple rule-of-thumb that could guarantee 
that a bike is easy to ride. Trail could be useful. 
Gyroscopic effects could be useful. Centre of 
mass could be useful. For Papadopoulos, this 
was revelatory. The earliest frame builders had 
simply stumbled on a design that felt OK, and 
had been riding around in circles in that nook 
of the bicycle universe. There were untested 
geometries out there that could transform bike 
design. 


THE CRASH 

After two years, Ruina could no longer support 
Papadopoulos. Apart from the bike manufac- 
turer Murray, the only industry donations the 
two ever got were from Dahon and Moulton, 
makers of small-wheeled bicycles — perhaps 
because the bikes’ unconventional designs could 
make them tricky to ride. Ruina joked that he 
should change the name to the “Folding Bicycle 
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Research Project”. It was gallows humour. 

And although Papadopoulos was mak- 
ing progress in the mathematics of bikes, the 
only article that he ever published as the first 
author was a book chapter’. “I find much more 
joy discovering the new and working out the 
details and, of course, it’s boring to write it 
up,” he says. Without money or publications, 
his time in bicycle research wound down. In 
1989, he put his bikes into a moving van and 
drove west to Illinois, where his then-wife had 
ajob. He endured a succession of teaching and 
industry jobs that he hated. In his spare time, 
he moderated the Bicycle Science e-mail list 
for bicycle-science nerds and helped to build a 
car that fitted into a few suitcases for the reality 
television show Junkyard Wars. 

In 2001, David Wilson, an MIT engineer and 
inventor of one of the first modern recumbent 
bicycles, invited Papadopoulos to co-author 
the third edition of the book Bicycling Science. 
Papadopoulos was overwhelmed by monetary 
debts and responsibilities. He failed to send 
Wilson the first chapter, and then stopped 
responding to e-mails altogether. Wilson felt 
betrayed. “He is a rather brilliant guy,’ Wilson 
says, but “he always had problems finishing 
anything”. 


BACK TO THE BIKE 

At Cornell, Ruina moved on. He applied the 
team’s insights about bicycles to a new arena: 
robots. If bicycles could demonstrate such 
elegant stability without a control system, 
he reasoned, it might be possible to design a 
stripped-down walking machine that achieves 
the same thing. In 1998, he worked with 
Martijn Wisse, a graduate student of Schwab’ at 
the Delft University of Technology in the Neth- 
erlands, to build a bipedal machine that could 
walk down a slight incline with no motor at all, 
storing energy in its swinging arms. Adding 
a few electronic motors generated an energy- 
efficient robot that could walk on level ground. 

In 2002, Schwab decided to spend his 
sabbatical with Ruina, and they started discuss- 
ing the old bicycle work. It was then that Ruina 
called Papadopoulos and paid for him to visit. 
“That was the first time I met the genius,” says 
Schwab. 

With more bicycles on the road than ever 
before, Schwab found it inconceivable that 
no one had published the correct set of bike 
equations, or applied it to bicycle design chal- 
lenges. Within a year, he and Jaap Meijaard, an 
engineer now at the University of Twente in 
the Netherlands, independently derived their 
own equations and found complete concord- 
ance with Papadopoulos’. They presented the 
definitive bicycle equations at an engineer- 
ing conference in South Korea, and the four 
collaborators published them jointly’. 

The challenge now was to prove that it was 
more than just a mathematical finding. Schwab 
and a student spent a year building a self-stable 
bike with a very small negative trail. Looking 


like the offspring of a razor scooter and a see- 
saw, it had a weight angled out in front of the 
front wheel and a counter-turning wheel to 
cancel out gyroscopic effects. In a video of it 
coasting, you can see it lean and veer to the 
right, but then recover on its own’. The experi- 
ment proved that Papadopoulos had been right 
about the complex interplay of factors that 
make a bicycle stable or unstable. 

Yet, after waiting for three decades for his 
discoveries to reach a wider audience, Papado- 
poulos can't help but feel deflated. “It did not 
change everything in the way that we imag- 
ined, he says. This year’s bike frames look 
much like last year’s. “Everyone is still in the 
box,’ he says. Nevertheless, other researchers 
have since been pulled into the groups orbit, 
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Today, Schwab has the kind of laboratory 
that Papadopoulos always dreamed of, and 
Papadopoulos is grateful to be able to collabo- 
rate. “It's the most beautiful thing you can imag- 
ine,’ he says. Schwab's other projects include a 
‘steer by wire’ bike, which allows him to sepa- 
rate steering movements from balancing ones, 
and a ‘steer assist’ bicycle, which stabilizes itself 
at slow speeds. He has also identified a rear- 
steered recumbent bike that shows self-stability, 
in part owing to an enlarged front wheel that 
boosts gyroscopic effects. The chief advantage 
of a rear-steered recumbent is that it would 
have a shorter chain than standard recumbents, 
which should lead to better energy transfer. 
“People have tried to build them before, but 
they were unrideable,’ Schwab says. 


“ONCE YOU HAVE A ROBOT BICYCLE, YOU 
CAN DO ALOT OF CRAZY EXPERIMENTS.” 


creating enough momentum to launch a 
Bicycle and Motorcycle Dynamics conference 
in 2010. It gathers together tinkerers from all 
over the world, some of whom have also built 
weird experimental bicycles to test design 
principles. 

One of the organizers of this year’s 
conference, engineer Jason Moore of the 
University of California, Davis, has sought to 
probe the link between a bicycle frame's geom- 
etry and an objective measure of handling — 
its ease of control’. The work was inspired by 
extensive military research on aircraft pilots. 
Moore created a model of human control by 
performing various manoeuvres on bikes kit- 
ted out with sensors to monitor his steering, 
lean and speed. To force himself to balance 
and ride using steering movements alone 
(rather than shifting his weight), he had to 
don a rigid upper-body harness that bound 
him to the bike. The research confirmed the 
long-standing assumption that more stable 
bikes handle better, and potentially gives frame 
builders a tool to optimize their designs. 

It also introduced a puzzle: the steering 
torque required was two or three times that 
predicted by the Whipple bicycle model’. This 
might have been caused by friction and flexing 
of the tyres, which are not part of the model, 
but no one is certain. For further tests, Moore 
and his colleagues have built a robotic bike 
that can balance itself. “Once you have a robot 
bicycle, you can do a lot of crazy experiments 
without having to put a human in danger,” 
he says. (One of his earlier handling experi- 
ments had him regaining his balance after a 
sideways blow from a wooden stick.) Unlike 
many other riderless-bike robots, it does not 
use internal gyroscopes to stay upright, but 
depends on steering alone. Moore has shipped 
it to Schwab for further study. 
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Papadopoulos, who now has a teaching 
position at Northeastern University in Bos- 
ton, is trying to get comfortable with academia 
once again. He's establishing collaborations, 
and testing out long-dormant ideas about 
why some bicycles wobble at high speed’. He 
believes he can eliminate speed wobble with a 
damper to soak up vibrations in the seat post. 
With his new colleagues and students, he is 
branching out into other types of question, 
not all them bike-related. 

Down in his basement, Papadopoulos 
opens the drawer of a tan filing cabinet and 
starts flipping through crinkled manila fold- 
ers marked with labels such as ‘tire pressure; 
‘biomechanics’ and ‘Cornell’. He pulls out 
a textbook. “Exercise physiology? I never 
really got into that one,” he says, tossing it 
aside. In the back of the drawer, he finds a 
thick folder of bicycle research ideas, marked 
‘Unfinished. 

Papadopoulos thinks for a second and then 
offers a correction: “Mostly unfinished” = 


Brendan Borrell is a journalist in New York 
City. 
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THE UNSUNG HEROES 


The soaring popularity of gene editing 
has made celebrities of the principal 
investigators who pioneered the 

field — but their graduate students and 
postdocs are often overlooked. 


BY HEIDI LEDFORD 


hen Blake Wiedenheft started 

studying microbes, his work 

was both remote and obscure. 

He spent his PhD sampling 

hot springs in Yellowstone 

National Park, then cre- 

ated artificial versions in the 

laboratory to study the microorganisms that 

lived in the inhospitable water. “We wanted to 

understand how life could survive in boiling 
acid,” he says. 

Over time, Wiedenheft became more inter- 
ested in how microbes fend off viruses. He read 
around, and came across a peculiar bacterial 
immune system called CRISPR. In 2007, he 
approached Jennifer Doudna, a molecular 
biologist at the University of California, Berke- 
ley, and found that she shared his interest. Join 
the lab, she said — and he did. Over the next 
five years, he studied the structure and bio- 
chemistry of CRISPR systems, landing a first- 
author publication in Nature’. 

Today, CRISPR is a household name for 
molecular biologists around the world. 
Researchers have eagerly co-opted the system 
to insert or delete DNA sequences in genomes 
across all kingdoms of life. CRISPR is being 
used to generate a new breed of genetically 
modified crops and may one day treat human 
genetic diseases. Doudna and other principal 
investigators involved in the seminal work 
have become scientific celebrities: they are pro- 
filed in major newspapers, star in documen- 
taries and are rumoured to be contenders for 
a Nobel prize. “When I came to the lab, I was 
the only person studying CRISPR,” Wieden- 
heft says. “When I left the lab, almost everyone 
was studying it” 

Wiedenheft, however, has hardly achieved 
the same fame as his mentor — and nor have the 
other students and postdocs who toiled at 
the bench to make CRISPR genome editing 
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a reality. They certainly reap benefits from 
their work: support and reflected glory from 
their supervisors, as well as expertise in a cov- 
eted technique. But some also face a difficult 
transition to becoming independent scien- 
tists as they try to establish themselves in a 
hypercompetitive field. 

For Wiedenheft, the key to survival has been 
seclusion. When he struck out from Doudna’s 
lab, he opted for a return to Montana State 
University in Bozeman, where he did his PhD, 
over an offer from a larger, better-known insti- 
tution. “At the end of the day, the opportuni- 
ties for solitude and being outdoors make me 
more creative and a better scientist,” he says. 
But like other young scientists who graduate 
from powerhouse labs, he can’t help but won- 
der how different life might have been if acco- 
lades in biomedical science were given to the 
first authors on a paper, rather than the last. 
Now and then, he admits, he doesn't feel quite 
appreciated enough. “Some days it matters, 
some days it doesn't” 


AN EDITED HISTORY 

The history of CRISPR-Cas9 gene editing 
has become a subject of fierce debate and a 
bitter, high-stakes patent battle. Researchers 
and institutes have been jostling aggressively 
to make sure that they are credited for their 
share of the work in everything from academic 
papers to news stories. “I get a lot of phone calls 
from lawyers about what I did and when,” 
Wiedenheft says. 

In January, Eric Lander, president of the 
Broad Institute of MIT and Harvard in Cam- 
bridge, Massachusetts, tossed into this mine- 
field a historical portrait called “The Heroes of 
CRISPR”. It was instantly controversial. Some 
said that it marginalized the contributions 
of certain researchers, and they questioned 
the decision to publish the article without a 


UF CRISPR 


conflict-of-interest statement noting that the 
Broad Institute is embroiled in a patent dis- 
pute that hinges on determining who invented 
CRISPR-Cas9 gene editing. 

But for George Church, a geneticist at Har- 
vard Medical School in Boston, Massachusetts, 
who is also a pioneer in the field, it was par- 
ticularly painful to see statements attributing 
key discoveries to him rather than his postdocs 
and graduate students. “Eric said my name too 
many times,’ Church says. 

Lander says that there was no intended 
slight in the ‘Heroes’ story. He was mindful that 
there were dozens of other co-authors on the 
key papers, “But I couldn't figure out how to 
collect and tell their stories within a nine-page 
article.” If anything, he adds, the article wid- 
ened the CRISPR spotlight: most discussion 
up to that point had focused on 3 major con- 
tributors to the field, whereas his piece featured 
17 major players and acknowledged that there 
were many others. 

Any lack of attention to CRISPR’s junior 
discoverers comes despite fervent advocacy 
on the part of their advisers. Junior investi- 
gators in the Church lab praise their leader’s 
unwavering support, along with the unique 
intellectual environment he has fostered in 
the lab. Doudna is a fierce champion of the 
scientists she has mentored. “It’s really impor- 
tant for junior investigators to get the credit 
they deserve,” she says. “They really drive the 
scientific enterprise.” What is more, academic 
papers often set out each author’s contribu- 
tion to the work. 

But those details often get lost simply because, 
broadly speaking, credit in science goes to the 
leader of the lab, as do any prizes that follow. 
“That's just how the system works, and I accept 
my role in this system,” says Martin Jinek, 
another Doudna lab alumnus. “But yeah, it’s 
something you cant help but think about” > 
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> Sometimes people may take note of the 
first author, but not in a meaningful way, says 
Rachel Haurwitz, a former Doudna gradu- 
ate student and now president of Caribou 
Biosciences in Berkeley, California. “They'll 
say ‘the 2012 Jinek paper’ but most people have 
no idea who Martin Jinek is,” she says. 

Jinek was co-first author on a seminal 
paper’ showing that the enzyme Cas9 can 
be programmed to target specific sequences 
of DNA using only a short strand of RNA — 
and he found that his life became defined by 
CRISPR. When he entered the job market, he 
couldn't even discuss the work in interviews 
because the patent had not been filed. Even so, 
he got an attractive offer from the University 
of Zurich in Switzerland, and has since built a 
lab there that focuses on the basic biology of 
CRISPR more than its applications. 

As interest in CRISPR-Cas9 gene editing 
grew, his schedule became packed: he now 
travels to talks two to three times a month. 
Although he appreciates the professional boost 
that the CRISPR frenzy has given him, he also 
struggles to find a balance between running his 
lab and other obligations. 

Haurwitz has faced obstacles, too. She spent 
her PhD characterizing the CRISPR-based 
microbial immune system and the structure 
of a CRISPR-associated enzyme called Cys4 
(ref. 4). In 2011, she co-founded Caribou 
along with Doudna and others to commer- 
cialize research tools based on CRISPR. The 
early days were tough, but Caribou has since 
formed partnerships with major industry play- 
ers, and the company announced in May that 
its latest round of fundraising had brought 
in US$30 million. Yet as the firm has grown, 
some investors have pushed to replace Haur- 
witz with a more seasoned leader. Doudna has 
quashed the idea. ” There’s no reason to replace 
her,” says Doudna. “She keeps showing that she 
has the talent to be successful” 


RIDING THE WAVE 

For many early career scientists, working 
in such a hot field has clear advantages. As a 
postdoc, bioengineer Prashant Mali helped 
to launch the CRISPR project in Church’s 
lab. He was a co-first author on the lab’s 2013 
paper’ demonstrating that CRISPR-Cas9 
could be used to edit the genome in human 
induced pluripotent stem cells. 

The discovery sent CRISPR excitement 
to fever pitch — a wave of enthusiasm that 
Mali rode into the job market later that year. 
“T definitely got a lot of endorsements,” he 
says. (There is, however, no mention of him 
in ‘Heroes of CRISPR — a sore point with 
Church.) Eventually settling at the Univer- 
sity of California, San Diego, Mali continues 
to study stem-cell development and develop 
CRISPR-based tools. He accepts the intensity 
of the field as a small price to pay. His lab is 
just 18 months old — too young to have been 
scooped yet, he says — but competition is 
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inevitable. “There will obviously be a lot of 
overlap of good ideas.” 

CRISPR threw open doors for Luhan Yang, 
the other first author on the 2013 Science 
paper from Church’s lab. Soon after the paper 
was published, the lab was contacted by sev- 
eral researchers who study organ transplanta- 
tion. They wanted to know whether genome 
editing could now be used to engineer 


P pig organs so that 
“Theidea they would be less 


that scientific likely to provoke an 
discovery immune response in 
involves just humans. Yang seized 
one. two or the idea with gusto, 
, ° says Church. 
three people 1s The pig genome 
sonineteenth-_— ishome to retroviral 
century. . DNA, and concern 


that those retrovi- 
ruses might become reactivated in a human 
host led many researchers to flee the field in the 
late 1990s. Yang reasoned that the retroviral 
sequences are so similar to one another that a 
single CRISPR-Cas9 experiment might knock 
out many of them at once. She and her three 
co-first authors now hold the world record® 
for the largest number of sequences targeted 
in a single CRISPR-Cas9 experiment: 62. And 
Yang is raising money to launch a company 
with Church called eGenesis, to further the 
work. “George always gave me the opportunity 
to establish my leadership,’ she says. 

Across the Charles River from the Church 
lab, graduate student Le Cong worked side-by- 
side, late into the night with his mentor, bio- 
engineer Feng Zhang at the Broad Institute, to 
develop CRISPR gene editing in mammalian 
cells. Zhang was himself a young investigator 
just launching a lab when Cong joined; Cong 
remembers opening the box containing the 
lab’s first centrifuge and sitting with Zhang at 
a computer googling ‘DNA-binding protein’ 
to look for new ways to edit genomes. The two 
became a tightly knit team. 

When they embarked on the CRISPR pro- 
ject, it seemed like along shot as Cong screened 
enzymes and reaction conditions, trying to find 
those that would work in human cells. 

But Cong was willing to take the risk. He and 
Zhang had previously pioneered the use of a 
different gene-editing system, called TALENs, 
in mammalian cells, and he reasoned that this 
early success would be enough to allow him 
to graduate if the CRISPR project failed. He 
never had to test that hypothesis: in 2013, Cong 
and his fellow graduate student Fei Ann Ran 
co-first authored a Science paper’ showing that 
the system works in mammalian cells — the 
paper was published simultaneously with that 
of Mali, Church and their team. 

At that point, Cong was advised that he 
could skip the postdoc and go straight to a 
faculty position. But he worried that doing 
so would limit him: he could be pigeonholed 
as ‘the CRISPR guy. “I felt uncomfortable 


about that,” he says. “I was not only looking 
to develop technology.” Instead, Cong opted 
for another postdoc; he is now embarking on 
a faculty job search, and plans to use his lab 
to study allergies and autoimmune disorders. 

Cong says that he feels no resentment at 
being largely excluded from the CRISPR media 
frenzy and attention centred on Zhang. “I do 
think I’ve been recognized,’ he says; Zhang 
has been generous in giving him credit within 
the scientific community, and has encouraged 
Cong to give talks in his stead. 

And Cong, like others interviewed for this 
story, is himself insistent about giving credit to 
others in the field. He sprinkles in references 
to work done in other labs, including some of 
the earliest microbiology work characterizing 
the CRISPR system. Wiedenheft says that’s 
characteristic of the CRISPR community. “It’s 
competitive, but it’s friendly” 

Outside that community, however, the acco- 
lades continue to be heaped on senior investi- 
gators. “We need to invent ways to expand the 
medals podium,” says Lander. “The idea that 
scientific discovery involves just one, two or 
three people is so nineteenth-century.” 

There are many more unsung heroes of 
CRISPR than this article could do justice to. 
One often overlooked group is headed by 
Virginijus Siksnys at Vilnius University in Lith- 
uania — where Giedrius Gasiunas began his 
PhD in 2007. He plugged away for years, tack- 
ling the biochemistry of CRISPR-Cas9 and, 
like Jinek, eventually came to the conclusion 
that the Cas9 enzyme could be programmed 
to cut isolated DNA at specific sites. 

In 2012, the lab sent a paper to Cell, where 
it was rejected without review. Gasiunas then 
submitted the paper to the Proceedings of the 
National Academy of Sciences and waited. 
A few months later, while his paper was still 
under review, the now-legendary Jinek paper 
appeared in Science. The two papers had key 
differences, but reached similar conclusions. 
Gasiunas had been scooped’. 

Seeing other scientists collect awards for 
CRISPR gene editing sometimes irks Gasiu- 
nas, now a postdoc in Siksnys’s lab. But the 
experience has not entirely soured him on the 
subject. Although he has since been scooped 
again, he finds it no longer stings as much as 
it did. “It’s a risky field, he says. “But I think if 
you want to achieve something great, you need 
to take risks.” = 


Heidi Ledford is a senior reporter for Nature 
based in London. 
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Wearable devices that track fitness are a rich source of real-time health data. 


Stop the privatization 
of health data 


Tech giants moving into health may widen inequalities and harm research, unless 
people can access and share their data, warn John T. Wilbanks and Eric J. Topol. 


ver the past year, technology titans 
() including Google, Apple, Microsoft 

and IBM have been hiring leaders in 
biomedical research to bolster their efforts to 
change medicine. 


In September 2015, Tom Insel announced 
that he would quit his position as head of 


the US National Institute of Mental Health 
to join Google Life Sciences (now Verily). 
Three months later, Michael McConnell 
took a leave of absence from directing 
major cardiovascular research programmes 
at California's Stanford University to join 
him. And last month, Stephen Friend tooka 


senior position with Apple. He is co-founder 
and former president of Sage Bionetworks, a 
non-profit organization that promotes open 
science and patient engagement in research 
(where one of us, J.T. W, works). 

In many ways, the migration of clinical 
scientists into technology corporations 
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> that are focused on gathering, analysing 
and storing information is long overdue. 
Because of the costs and difficulties of 
obtaining data about health and disease, 
scientists conducting clinical or population 
studies have rarely been able to track suf- 
ficient numbers of patients closely enough 
to make anything other than coarse predic- 
tions. Given such limitations, who wouldn't 
want access to Internet-scale, multidimen- 
sional health data; teams of engineers who 
can build sensors for data collection and 
algorithms for analysis; and the resources 
to conduct projects at scales and speeds 
unthinkable in the public sector? 

Yet there is a major downside to monoliths 
such as Google or smaller companies such 
as consumer-genetics firm 23andMe own- 
ing health data — or indeed, controlling the 
tools and methods used to match people's 
digital health profiles to specific services. 

Digital profiling in other contexts is 
already creating what has been termed a 
‘black box’ society. Online adverts are tai- 
lored to people's age, location, spending 
and browsing habits. Certain retail ser- 
vices have preferentially been made avail- 
able only to particular groups of people. 
And law enforcers are being given tools to 
help them make sentencing decisions that 
cannot be openly assessed (see go.nature. 
com/29umpul). This is all thanks to the 
deliberately hidden collection and manipu- 
lation of personal data. 

If undisclosed algorithmic decision- 
making starts to incorporate health data, the 
ability of black-box calculations to accen- 
tuate pre-existing biases in society could 
greatly increase. Crucially, if the citizens 
being profiled are not given their data and 
allowed to share the information with others, 
they will not know about incorrect or dis- 
criminatory health actions — much less be 


THE DIGITAL HEALTH RUSH 


Analysts predict that worldwide, the value of 
Internet-enabled devices used to collect, 
analyse and distribute health and fitness data 
will exceed US$163 billion within 4 years. 
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able to challenge them. And most researchers 
wont have access to such health data either, 
or to the insights gleaned from them. 


INFORMATION FLOW 

We believe that the influx of health experts 
into Silicon Valley could foreshadow a fun- 
damental shift in biomedical research and 
health care. 

In various countries, including the United 
Kingdom, Sweden, Norway and Estonia, 
sustained pushes by governments and civil 
society have made standardized electronic 
health records the norm. And in the United 
States, hopes are high that truly useful elec- 
tronic health records (that can be called up 
no matter which provider a patient goes to) 
will be rolled out within the next five years. 
Meanwhile, advances are being made in 
machine learning, and there is increased 
availability of digital health data outside 
the medical system. This type of informa- 
tion is much more amenable to machine- 
learning approaches than are conventional 
clinical observations. Furthermore, some of 
the technology corporations entering health 
have hundreds of billions of dollars in cash 
reserves. These factors combined could 
enable the new players to eclipse, not just 
join, the old ones. 

Until now, obtaining health data has 
generally depended on highly skilled 
professionals who record perhaps tens 
to hundreds of observations in a clinic or 
hospital ward once or twice a year, and on 
researchers who painstakingly extract the 
relevant information from hard-to-obtain, 
non-standardized medical records. Now, 
thanks to smartphone apps — such as those 
created using Apple’s software framework 
ResearchKit, launched last year — and 
wearable sensors that can detect gait, loca- 
tion, heart rate and even brain activity, 
analysts can draw on tens of thousands of 
real-time observations collected from tens 
of thousands of people every day, even every 
minute. 

People’s search behaviour contains 
health information too. In February 2015, 
Google partnered with the Mayo Clinic, a 
non-profit medical practice and research 
group based in Rochester, Minnesota, to 
curate health-related facts, such as the 
most common causes of low back pain. 
Google’s teams are plugging curated and 
verified health information into the search 
engine’s smart search algorithm, Knowl- 
edge Graph, so that users obtain informa- 
tion that is more relevant and supposedly 
more accurate whenever they type in 
their symptoms or condition. The service 
(which for many ailments may replace 
visits to the doctor) will only enhance 
Google’s — and only Google’s — ability to 
conduct an unprecedented level of infor- 
mation retrieval for health. By the tech 


giant’s own calculations, of the 40,000 or 
so searches that are made every second, 
2,000 are health-related. 

Meanwhile, machine learning is enabling 
researchers to mine petabytes of data for 
patterns and associations. The IBM Watson 
technology platform, for example, sifts 
through unstructured data using natural 
language processing and machine learning. 
Since April last year, IBM has been build- 
ing ‘knowledge experts’ for health using the 
Watson Health cloud platform: customers 
who store their data in the cloud are given 
access to algorithms that can help them to 
make sense of the data. IBM is already using 
Watson to tackle complex disease problems 
in collaboration with the University of Texas 
MD Anderson Cancer Center in Houston — 
specifically, how to use genomic information 
from cancer cells to provide individuals with 
targeted treatments. 

Even the rudimentary measurements 
already flowing from smartphones and 
wearable sensors can drive better out- 
comes than decades of medical-device 
development have been able to provide. 
For instance, in 2014, a woman with type 1 
diabetes wired together a tiny processor, 
an insulin pump and a continuous glucose 
monitor to automate the regulation of her 
blood sugar levels. For a small community 
of patients, the collective use of such ‘home- 
made’ systems has resulted in improve- 
ments that are well ahead of those provided 
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by devices and interventions emerging 
from conventional markets’. 

In short, actors big and small are waking 
up to the enormous profits that could be 
made from inexpensive consumer health 
data. Around ten years ago, a scattering of 
start-ups were obtaining people’s genomic 
information and other health data at low cost 
and making them available to pharmaceu- 
tical partners or other customers. In 2016, 
every major player in tech — Apple, Google, 
IBM, Facebook, even Uber — has signalled 
plans to enter this ‘digital health’ market. 


ACLOSED LOOP 
To be fair, harnessing advances in 
technology and analytics to radically 
improve health care is a principal motivator 
for many corporations. But it is telling that 
the early players in the game have seques- 
tered information in ‘closed loop’ systems. 
Take the wearable device Enlite, which 
was made available by the Dublin-based 
firm Medtronic to people with diabetes in 
2013. This sends insulin into the wearer's 
blood when a sensor detects a drop in glu- 
cose levels. Although patients can moni- 
tor their glucose levels at any instant, their 
aggregate records are not made accessible to 
them. And there is no mechanism by which 
patients or researchers outside the company 
can gain access to Medtronic’s tens of thou- 
sands of measurements. The same is true 
for other wearable devices produced by the 


same firm. In fact, the company has refused 
requests for patients’ own heart data (see 
go.nature.com/29tdzud). 

Even when corporations do give custom- 
ers access to their own aggregate data, built- 
in blocks on sharing make it hard for users 
to donate them to science. 23andMe, holder 
of the largest repository of human genomic 
data in the world, 


allows users to view “Of the 40,000 
anddownloadtheir 0 $0 searches 
own single-letter that are made 
DNA variantsand every second, 
share their data 2,000are 


with certain listed health-related.” 
institutions. But 
for such data to truly empower patients, 
customers must be able to easily send the 
information to their health provider, genetic 
counsellor or any analyst they want. 
Pharmaceutical firms have long seques- 
tered limited types of hard-to-obtain data, for 
instance on how specific chemicals affect cer- 
tain blood measurements in clinical trials. But 
they generally lack longitudinal health data 
about individuals outside the studies that they 
run, and often cannot connect a participant 
in one trial to the same participant in another. 
Many of the new entrants to health, unbound 
by fragmented electronic health-record plat- 
forms, are poised to amass war chests of data 
and enter them into systems that are already 
optimized (primarily for advertising) to make 
predictions about individuals. 


The companies 
jostling to get into 
health face some 
major obstacles, not 
least the difficulties 
of gaining regulatory 
approval for returning actionable infor- 
mation to patients. Yet the market value of 
Internet-enabled devices that collect and 
analyse health and fitness data, connect 
medical devices and streamline patient care 
and medical research is estimated to exceed 
US$163 billion by 2020, as a January report 
from eMarketer notes (see “The digital 
health rush’ and go.nature.com/29fbvch). 
Such a tsunami of growth does not lend 
itself to ethically minded decision-making 
focused on maximizing the long-term 
benefits to citizens. 

It is already clear that proprietary 
algorithms can replicate and exacerbate soci- 
etal biases and structural problems. Despite 
the best efforts of Google’s coders, the job 
postings that its advertising algorithm serves 
to female users are less well-paying than are 
those displayed to male users*. A ProPub- 
lica investigation in May demonstrated that 
algorithms being used by US law-enforce- 
ment agencies are likely to wrongly predict 
that black defendants will commit a crime 
(see go.nature.com/29aznyw). And thanks 
to ‘demographically blind’ algorithms, in 
several US cities, black people are about 
half as likely as white people to live in neigh- 
bourhoods that have access to Amazon’s 
one-day delivery service (see go.nature. 
com/29kskg3). 

We believe that closed-data and closed- 
algorithm business models in health — at 
scale — will hamper scientific progress by 
blocking the discovery of diverse ways to 
examine and interpret health data. Longer 
term, they could increase rather than 
reduce inequalities and injustices. It is not 
hard to picture a future in which compa- 
nies are able to trade people's disease pro- 
files, unbeknown to the patients. Or one 
in which health decisions are abstruse 
and difficult to challenge, and advances 
in understanding are used to aggressively 
market health-related services to people — 
regardless of whether those services actu- 
ally benefit their health. 


A 2011 protest in 
Berlin called for 
the right to keep 
personal digital 
data private. 


FOR BETTER, NOT WORSE 

It is not our intention to demonize technol- 
ogy companies. Closed systems have some 
innate short-term advantages over open 
ones. For instance, they can achieve coher- 
ence and scale faster than open systems can 
because their chief executives can dictate 
formats, standards and norms. But private 
capital will better serve public interests if 
at least some layers of the emerging health- 
research infrastructure are open (the raw 
data being one layer, the tools for analysis 
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another and the development of treatments 
and services for patients a third). 

In our view, sensor and other data and 
the methods for converting those data 
into clinically usable information should 
be a public good. The technical and busi- 
ness services that translate insights into 
value for individual citizens could well be 
closed. This would be analogous to Google, 
Apple Maps and Waze providing maps on 
the back of government-funded geospatial 
data. Under this scenario, private capital 
would fund the creation of meaningful 
patient experiences, not the creation of 
trade secrets about health. 

In theory, hard government regulation 
could prevent the collection and analysis of 
digital health data from becoming a high- 
profit business. The European Union has 
already passed sweeping directives to pro- 
tect people's digital information from being 
exploited for commercial or other purposes. 
But in the United States at least, the odds are 
slim that well-constructed digital civil-rights 
laws will be passed in the next 2-3 years. 

Many of the largest tech corporations have 
come to resemble small nations in their own 
right: they have enormous ‘natural resources’ 
(data and computing power) and global 
interests to pursue and protect. Just five 
US-based tech firms — Apple, Microsoft, 
Alphabet (Google's parent company), Cisco 
Systems and Oracle — had combined cash 
reserves of $504 billion in late 2015, much 
of which is held offshore to avoid taxation 
and regulation (see go.nature.com/29ep3zx). 
Even if the US government wanted to inter- 
vene, technologies and their accompanying 
business models are evolving faster than it 
can keep up. 

In our view, the creation of credible 
competitors that are open source is the most 
promising way to regulate closed business 
models. During the late 1990s, IBM, then 
one of the biggest players in the software 
market, abandoned its proprietary web 
server software in favour of selling services 
around open-source software. Once users 
realized that the open-source Apache web 
server and Linux operating system offered 
a viable alternative to commercial pack- 
ages, IBM started selling them its support 
and configuration services. At around the 
same time, the release of Netscape’s browser 
source code rapidly fuelled innovation in 
the browser market and prevented Micro- 
soft from creating a monopoly with Inter- 
net Explorer. In both cases, communities of 
hundreds, or at most a few thousand, trans- 
formed the landscape for the world’s largest 
corporations. 

The creation of public resources through 
government funding has a role in this. Soon 
after the publicly funded Human Genome 
Project announced in 2000 that a draft was 
complete, the private sequencing company 
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Celera stopped charging researchers for 
access to its data, deposited those data into 
the public database GenBank and focused 
instead on trying to develop treatments 
for disease. Yet the sums of money being 
directed towards public projects today — 
such as the Obama administration's Pre- 
cision Medicine Initiative, which aims to 
match treatments to patients’ genetic and 
physiological data — pale in comparison 
to the investment that many companies can 
bring. (23andMe’s latest fundraising round 
of $150 million represents 70% of the entire 
federal investment allotted to the Precision 
Medicine Initiative. ) 

What is needed are networks of open 
projects, combined with sufficient numbers 
of patients and citizens who are motivated 
to feed such projects with their health data. 


CASE STUDY 

At Sage Bionetworks, we are conducting four 
ongoing clinical studies on various diseases 
including Parkinson's. We are also providing 
the online interfaces for patients to upload 
their information, and portals to enable 
data sharing for a 


further three stud- “When it comes 
ies (on diabetes, to control over 
roa ee car our own data, 
rac nea) A088 health data 

these seven studies, 

must be 
we have enrolled 

where we 


more than 90,000 
participants since 
March 2015 (see 
go.nature.com/29qqk57). Participants have 
the automatic right to access and download 
complete copies of their own data. They 
also have the right to donate their data for 
broad reuse by ‘qualified researchers, mean- 
ing those that have validated their identity, 
passed a short test and signed a code of ethi- 
cal practice. 

In our studies, more than 75% of partici- 
pants elect to share their data — presumably 
to maximize the chances of investigators 
finding a way to help them and others 
like them. Our numbers are biased by the 
fact that we are surveying people who are 
already enrolled in clinical studies. But in 
a 2015 survey of a more generic US popu- 
lation by National Public Radio, 53% of 
those polled said that they would be will- 
ing to share their data anonymously with 
health-care professionals (see go.nature. 
com/29qr0b7). It could be transformative if 
even just 5% of the US population donated 
a copy of their health data to science. After 
all, it took only a small community of open- 
source software programmers worldwide to 
drive major shifts in the computing industry. 

In short, a movement not dissimilar to 
the environmental one is needed to break 
open how people's data are being used, and 
to illuminate how they could be used in 


draw the line.” 


the future. In the United States, it was the 
unified lobbying of small groups of activists 
in the 1960s and 1970s — each with differ- 
ent reasons for being concerned — that led 
to a series of groundbreaking federal ini- 
tiatives, such as the Clean Air Act and the 
creation of the Environmental Protection 
Agency. Likewise, at first, data donors may 
predominantly include those with personal 
incentives, or those who are philosophi- 
cally driven to share their health data, for 
instance through being part of the Quanti- 
fied Self community, which aims to use tech- 
nology to measure all aspects of our daily 
lives. Such early advocates for sharing could 
help to change norms by pushing for clearer 
messaging around consent and by raising 
awareness about what is at stake. 

Openness is not an easy goal. Numerous 
interlocking systems need to be designed, 
including those to protect privacy, to miti- 
gate harm caused by certain insights becom- 
ing public knowledge, and to enable people 
who do not wish to be phenotyped to opt 
out. But if the new era of digital health is 
accompanied by citizen-led pushes for more 
openness, it may not just be health care that 
is transformed. 

Citizens worldwide have too long a 
history of being passive players in health 
care — blindly following instructions from 
providers. And studies that have tracked 
reactions to revelations about global surveil- 
lance programmes suggest that most people 
are resigned to the idea that ownership and 
control of personal information is incompat- 
ible with the Internet age’. 

Yet just as social networking has rocketed 
around the world in a decade, a worldwide 
knowledge resource could soon be used to 
identify the best course of treatment for an 
individual on the basis of the experiences of 
millions. This resource will never be built 
unless each of us takes responsibility for our 
own health and disease, and for the informa- 
tion that we can generate about ourselves. 
When it comes to control over our own data, 
health data must be where we draw the line. = 
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An algal bloom in Stuart, Florida, in June led to a state of emergency. 


SE 


Study role of climate 
change in extreme 
threats to water quality 


Record-breaking harmful algal blooms and other 
severe impacts are becoming more frequent. We need 
to understand why, says Anna M. Michalak. 


ith concerns about climate 
‘extremes’ growing’, water is 
often the focus — either too 


much or too little. That is no coincidence: 
climate and the hydrological cycle are tightly 
coupled, and water is essential to ecosystems 
and societies. But it is not just the quantity of 
water that matters. So does its quality. 

Last year, Lake Erie, one of the US Great 
Lakes (which contain one-fifth of the world’s 
fresh surface water), experienced its biggest 
recorded harmful algal bloom. At its peak, 
the bloom spread some 200 kilometres 
across most of the lake’. Meanwhile, off the 
continent’s west coast, another record harm- 
ful bloom stretched from Baja California in 
Mexico up to Alaska, probably triggered by 
unusually warm water in the Pacific Ocean’. 


Both blooms were dominated by species of 
phytoplankton that produce powerful toxins. 

Such episodes can wreak havoc. During 
a previous bloom, in 2014, 500,000 people 
living near Lake Erie were ordered not to 
drink tap water, because it contained levels of 
hepatotoxins produced by the cyanobacte- 
rium Microcystis that were 2.5 times higher* 
than the World Health Organization's safe 
standard. The 2015 west-coast bloom of the 
diatom algae Pseudo-nitzschia shut down 
fisheries. The Dungeness crab fishery, one of 
the most valuable on the west coast, opened 
four months late owing to toxic levels of the 
neurotoxin domoic acid in the crabs. Brain 
damage in sea lions has also been docu- 
mented asa result of exposure to this toxin’. 

Impaired water quality is a global and 


growing problem, limiting resources for 
drinking, domestic use, food production and 
recreation, as well as harming ecosystems. 
The types and causes range from excess 
nutrients feeding harmful algal blooms 
and hypoxic ‘dead zones, to bacterial, viral 
and chemical contamination, to pollution 
by personal-care products and pharma- 
ceuticals. Cases of extreme impairment 
often lead to disproportionate human and 
ecosystem impacts. The costs can be huge. 
More than US$4 billion are lost each year in 
the United States alone as a result of harmful 
algal blooms®. 

Because the most severe water-quality 
impacts are exacerbated by weather, cli- 
mate plays a part. Runoff of nutrients from 
farmland spikes after heavy rains; warm 
temperatures accelerate the growth of bac- 
teria and phytoplankton. As climate change 
alters weather patterns and variability, con- 
ditions conducive to severe water impair- 
mentare likely to become more frequent. 

Yet there has been scant study of how 
climate will affect the occurrence of the 
extreme events that relate to water quality 
rather than quantity. We do not know how 
to relate water-quality extremes, their causes, 
their severity or their occurrence directly to 
changes in climate. It is time to plug this 
knowledge gap. 


COMPLEX CHAIN 

Scientific understanding of extreme storms, 
droughts and rising sea levels has improved 
markedly over the past decade. The impacts 
of extreme weather events are integral to 
discussions about climate-change mitiga- 
tion and adaptation. The expected rise in 
the frequency and severity of such events 
is well established, and even individual epi- 
sodes have been linked probabilistically to 
climate change’. 

Not so for water quality. Researchers 
have explored trends in water quality with 
climate, but the science of projecting and 
attributing the occurrence of extremes is in 
its infancy. This is despite evidence of strong 
links with climate. 

Regional studies reveal how multiple fac- 
tors often conspire to create conditions ripe 
for dire water quality. For example, summer 
toxic blooms in Lake Taihu, the third-largest 
freshwater lake in China, are more intense 
after tropical cyclones, because the associ- 
ated rains wash more nutrients into the lake, 
and the subsequent warmer temperatures 
and lower wind speeds further encourage the 
growth of blooms’. In 1999, a series of hur- 
ricanes triggered severe hypoxia in Pamlico 
Sound, North Carolina (part of the United 
States’ second-largest estuarine system) by 
delivering huge amounts of nutrients, organic 
carbon and fresh water to the estuary®. In 
North American regions as diverse as the 
Great Lakes, the east coast’s Chesapeake Bay 
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and the Gulf of Mexico, the severity of sum- 
mer hypoxia has been linked to the effects of 
weather on nutrient loading, water-column 
structure and water circulation. Although 
these examples all relate to harmful algal 
blooms and hypoxia, other types of water pol- 
lution are also affected by weather conditions, 
which are altered by climate change. 


THE CHALLENGE 

Several factors explain the paucity of 
research. First, the role of climate is difficult 
to pinpoint, because changes in water quality 
result from a delicate and complex interplay 
of human activities across local, regional and 
global scales. Complex chains of causative 
steps must be understood. These start with 
how climate change affects factors such as 
precipitation, temperatures and wind patterns 
for given regions and watersheds. Next, we 
must understand how these conditions alter 
the flow of water, nutrients, contaminants and 
other constituents to water bodies. Finally, we 
need to assess how these inputs, combined 
with meteorological conditions that influ- 
ence freshwater and coastal systems directly, 
will change water quality. We do not yet know 
how to put the pieces of this puzzle together. 

For example, the harmful algal blooms in 
Lake Erie are driven by excess phosphorus 
from changing farming and land-manage- 
ment practices in the region, but a summer 
2011 bloom shattered previous records. A 
forensic analysis’ revealed that a series of 
intense rainstorms led to record springtime 
discharge from rivers, which flooded the lake 
with a record amount of nutrients flushed 
from fields. Warmer-than-average summer 
temperatures and low winds then accelerated 
the growth of buoyant Microcystis cyanobac- 
teria. Unusually weak water circulation dur- 
ing the summer kept nutrients in the lake for 
longer, further feeding the bloom. How cli- 
mate change influences the occurrence of all 
these factors, individually and together, needs 
to be elucidated to predict the likelihood of 
similar events happening more frequently in 
the future. 

Second, water-quality and climate 
scientists work in disciplinary silos, and 
each tends to have a different scale of focus. 
Whereas much of climate science is global 
or concerned with large regions, most 
hydrologists and limnologists study pro- 
cesses in individual streams, lakes, water- 
sheds or estuaries. Similarly, water-quality 
impacts have been treated mostly as local 
or regional issues, resulting from human 
activities in a particular basin or watershed. 
Little attention has been given to the local 
impacts of human action at global scales. 
For example, strategies for preventing 
harmful blooms and hypoxia often rightly 
focus on limiting nutrient loads through 
land management, but should also con- 
sider how a shifting climate will alter local 
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meteorology, and thus nutrient loads. 

A clear understanding of the interplay 
between climate and severe water-quality 
impairment events is predicated on tracking 
cause and effect across a cascading range of 
scales, from the globe to individual water- 
sheds and from decades to days. 

Third, observational evidence to under- 
pin a global view of this interplay is lacking. 
Unlike for weather variables such as temper- 
ature and precipitation, no global network 
tracks water quality. Existing monitoring 
of water quality is sparse in space and time, 
and site-specific. Satellite-based observa- 
tions could expand coverage, but there are 
no widely accepted 


approaches for “Thereis 
doing so. There is disagreement 
even disagreement about which 
about which vari- variables best 
ables best capture capture water 
water quality. For quality. ay 


example, is the 

severity of a harmful algal bloom best rep- 
resented by its area, the total mass of phyto- 
plankton it contains, the amount of toxins 
that it produces or the ecosystem and human 
impacts that it engenders? Each brings a dif- 
ferent observational challenge. 


NEXT STEPS 

Researchers need to assess which meteor- 
ological conditions, in what combination, 
make extreme water-quality impairments 
most likely. In doing so, they must also con- 
sider land use, land management, popula- 
tion distributions and other regional factors 
that compound the effects of weather. An 
initiative such as Future Earth, which pro- 
vides a research platform for global sustain- 
ability science, would be a good umbrella 
for developing and integrating such knowl- 
edge globally. 

The first step should be a retrospective 
analysis of past extreme events, to under- 
stand commonalities and differences across 
types of systems and impacts. The resulting 
conceptual model will differ depending on 
the type of problem — hypoxic dead zones 
and microbial outbreaks in recreational 
waters will not link to climate in the same 
ways. The water-quality and climate research 
communities will have to work together to 
identify key mechanisms and feedbacks. 

Understanding meteorological drivers 
would allow researchers to assess whether 
climate models can accurately represent the 
occurrence of key specific conditions (such 
as high precipitation or warm temperatures), 
as well as their joint or sequential occurrence 
(such as high precipitation followed by low 
winds and high temperatures). This could 
be explored, for example, in the context of 
the Coupled Model Intercomparison Project 
Phase 6 (CMIP6), and effort to compare cli- 
mate projections under different scenarios 


that is under way in anticipation of the 
next Intergovernmental Panel for Climate 
Change assessment. 

A clear conceptual model would also 
inform observational needs. Observations 
must capture the severity of extreme events, 
their impacts and key variables for assess- 
ing the links to climate change. There will 
be trade-offs between specificity and cover- 
age. For example, whereas satellites might 
monitor some water-quality parameters and 
impacts globally, other key indicators such 
as toxin concentrations can be tracked only 
in situ. The GEO (Group on Earth Observa- 
tions) Aqua Watch initiative would be a natu- 
ral forum for exploring these challenges’”. 

As the science improves, its implications 
must inform broader global discussions 
around water, such as the strategy for meeting 
the United Nations’ Sustainable Development 
Goal of ensuring ‘availability and sustainable 
management of water and sanitation for all; 
one of 17 such goals adopted last year. 

The scientific community has made 
remarkable progress in understanding the 
role of climate in the occurrence and inten- 
sity of droughts, storms and other extreme 
events relating to water quantity. It is time 
for a similar examination of extremes in 
water quality. = 
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CORRECTION 

The Comment ‘No wild east’ (D. Sipp 
and D. Pei Nature 534, 465-467; 2016) 
incorrectly cited (in ref. 4) and referred to 
the 2001 Chinese national guidelines on 
assisted reproduction when discussing 
the implantation of modified human 
embryos for reproductive purposes. 
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Genetically modified pet pigs: is tinkering with an animal’s genome to stunt its growth more ethically dubious than conventional breeding for small size? 


GENE EDITING 


Running with scissors 


John Harris probes a study on the science and ethics of genome editing. 


pharmacologist John Parrington has pro- 
duced a veritable compendium of games 
that scientists like him can play with life itself. 
He invites us to imagine the potential of life 
forms “whose very genetic recipe was manu- 
factured in a chemistry lab using new compo- 
nents never seen before on Earth”. What larks! 
What follows is a thorough and com- 
prehensive account of the methodologies 
for altering life that have been or are being 
developed, and the directions that they may 
take in future. Those methodologies include 
the insertion or deletion of genes, the engi- 
neering of synthetic genes and the design of 
creatures unprecedented in nature. As Par- 
rington shows, many of the technologies are 
familiar: for example, designing immunity to 
disease through vaccination, or animal and 
plant breeding. He ends with the concept ofa 
“redesigned planet’, replete with new types of 
people, as well as designer babies, pets, plants 


lE Redesigning Life, molecular 
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and drugs. Invoking 
the catchphrase of 
comic-book super- 
hero Spiderman, “with 
great power comes 
great responsibility’, 
he touches on the chal- 
lenges that such a pos- 
sibility would entail. 


Redesigning Life: However, Par- 


How Genome rington’s way of deal- 
Editing Will ing with the ethical 
Transform the issues raised by the 
nord technologies that he 
JOHN PARRINGTON leefull 1 . 

Oxford University ay ae : y SAP Oneal 
Press: 2016. rather limited. Three 


examples give a sense 
of the nature of this problem. 

Early on, he notes that in agriculture, it 
is important to ensure that genome editing 
benefits the majority of humanity rather than 
stuffing the coffers of vast corporations. But 


he fails to say how this might be achieved. 
Given that we cannot ensure this for any 
product or service in any field, I wonder what 
this criticism amounts to? Attempting global 
distributive justice is a major political prob- 
lem of moral significance, but really, only 
governments can approach such utopian 
ideals. Demanding the impossible can never 
be rational ethics. 

Later, Parrington worries about designer 
babies engineered for looks, intelligence or 
extraordinary talent (opining, “Such fears run 
deep among scientists”). But he offers not a 
word about the cogency of the fears, or about 
the moral basis — or lack of it — for how peo- 
ple see things. Parrington also questions the 
use of stem-cell tech- 
nologyin helping older NATURE.COM 
women to have chil- _ Formoreonscience 
dren: “Would this be _ in culture see: 
seenas liberating oran nature 
irresponsible extension 
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of a woman's reproductive age?” However, 
how it would be seen is a sociological ques- 
tion. The ethical responsibility involves show- 
ing how it should be seen, and why. 

If Parrington is serious about the need to 
address ethical issues when tinkering with 
genes, more explanation is needed of just 
what the ethical issues are and how they 
might be resolved. Such an attempt involves, 
at the very minimum, an analysis of the types 
of design involved in, for example, produc- 
ing designer babies; it also involves giving 


i . reasons to morally 
“Why might it be prefer some sorts 


better toincrease of design or ways 
intelligence by of designing to oth- 
education, diet ers, as I explain in 
orexercisethan my book How to 
through gene be Good (Oxford 
editing 22 University Press, 
2016). Why might it 
be better to aim to increase cognitive powers 
and perhaps even intelligence by education, 
diet or exercise than through gene editing 
or drugs? One would also need to identify 
elements that would clearly be unethical to 
design into a person, such as an increased 
propensity to disease or premature death. 
Most importantly, one would need to 
consider why attempts at design are morally 
worse (if they are) than simply leaving things 
to the genetic lottery of sexual reproduction. 
There is a story that in the early twentieth 
century, the pioneering modern dancer 
Isadora Duncan suggested to writer George 
Bernard Shaw that they should have a child, 
surmising that with her looks and his brains 
any progeny would have huge advantages. 
The ever-rational Shaw responded, “But 
what if it had your brains and my looks?” Was 
Duncan's proposal unethical or just miscon- 
ceived? What would or should have made 
such a proposal ethically problematic? And 
if it was not ethically problematic, why might 
more ‘techie’ attempts become unethical? 
Gene editing is now the stuff of do-it- 
yourself ‘garage research, opening up night- 
mares for regulation and oversight. Our best 
hope is to clarify what it is morally permis- 
sible to do behind closed doors. I have no 
quarrel with the idea of redesigning our 
planet, or indeed ourselves, if the elements 
of design promote life, liberty and the pur- 
suit of happiness for those creatures to whom 
such things matter. These objectives are not 
only ethical, but mandatory ifintelligent life 
forms are to survive into the far future. The 
best way of ensuring that this is allowed to 
happen is to be clear about which are and 
which are not morally permissible means to 
this clearly desired and desirable end. m 
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University of Manchester, UK. 

e-mail: john. harris@manchester.ac.uk 


© 2016 Ma 


Books in brief 


Rise of the Machines: A Cybernetic History 

Thomas Rid W. W. NORTON (2016) 

Cybernetics, the science of communication and control in machines 
and organisms, has had a tangled history. Here, professor of war 
studies Thomas Rid unravels seven strands of it, starting with 
mathematician Norbert Wiener’s book Cybernetics (1948). Wiener’s 
big idea first gained traction in post-war research on ballistic 
calculations in airborne warfare. It then popped up in 1950s dreams 
of automated utopias, the concept of benevolent machines serving 
humanity in the 1970s and, from the 1980s, applications such as 
virtual reality, public-key cryptography and, ominously, cyberwar. 


Margherita Sarrocchi’s Letters to Galileo 

Meredith K. Ray PALGRAVE (2016) 

When astronomer Galileo Galilei was a rising star, he corresponded 
with the formidable Margherita Sarrocchi — a salon host, epic poet 
and polymath versed in geometry, natural philosophy and logic 

(J. Rampling Nature 520, 154-155; 2015). In this rich analysis of 
the exchange, Meredith Ray shows how the pair found common 
ground in assessing each other’s work, whether the discovery of 
Jupiter’s satellites or the crafting of poetic nuance. An illuminating 
explication of the dynamics in early-modern arts and sciences, 
complemented by the first full English translation of the letters. 


Fighting Fit: The Wartime Battle for Britain’s Health 

Laura Dawes WEIDENFELD & NICOLSON (2016) 

Historian Laura Dawes digs into that other victory of the Second 
World War: public health in Britain. There had been dire predictions, 
such as epidemics incubated in air-raid shelters. But by war’s end, 
UK rates of almost all infectious diseases had dropped, thanks to 
the Medical Research Council, Nobel laureates such as physiologist 
Andrew Huxley, hordes of researchers and a willing public. Dawes’ 
sparky account demonstrates how that rare teamwork advanced 
emergency care, preventive medicine, the treatment of insect-borne 
disease and, ultimately, the formation of the National Health Service. 


= Invisible Influence: The Hidden Forces that Shape Behavior 
shot men forces Jonah Berger SIMON & SCHUSTER (2016) 


‘ge In his best-seller Contagious (Simon & Schuster, 2013), marketing 
INVISIBLE scholar Jonah Berger explored influence through word of mouth — 
INFLUENCE a territory much probed by social psychologist Robert Cialdini 
| (T. Dietz Nature 479, 176; 2011). Here, Berger turns the tables, 
| JONA | examining the hidden social influences that nudge us into choosing 
BERGE | everything from jobs to desserts. Neatly compressing findings in 


ethology, economics and more, he looks at imitation, differentiation, 
motivation and the see-saw between novelty and familiarity. 
Crowded with insights — not least, for wary voters. 


Peacock and Vine: Fortuny and Morris in Life and at Work 

A. S. Byatt CHATTO & WINDUS (2016) 

In this faceted germ of a book, novelist A. S. Byatt muses on design 
revolutionaries Mariano Fortuny and William Morris. Fortuny, galvanized 
by ancient Minoan fabrics, printed scarves with patterns of murex shells 
and trilobite fossils, and devised intricate, permanent pleating for his 
sinuous ‘Delphos’ gowns. Morris recorded the “geometry of branches 
and petals and fruits”, bringing hedgerow and copse indoors with 
motifs and natural dyes that transformed typography, wallpaper and 
textiles. An ingenious comparison. Barbara Kiser 
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Correspondence 


Goals change crowd 
air chemistry 


During live public screenings 

of the 2016 UEFA European 
Championships, the emission 
rates of particular chemicals 

in the audience's breath vary 
sharply — apparently in response 
to events on the football pitch. 

Football matches induce fans 
to roar in jubilation, hold their 
breath in suspense and sigh with 
disappointment. On 26 June, we 
tracked reactions from a cinema 
audience during the Germany- 
Slovakia game by monitoring 
changes in air composition 
resulting from their exhalations 
(for methodology, see J. Williams 
et al. Sci. Rep. 6, 25464; 2016). 

In moments of high 
excitement, exhaled carbon 
dioxide seems to spike as people's 
heartbeats and breathing 
accelerate (see ‘Breath chemistry 
of football fans’). So do emission 
rates of isoprene, which is 
released from muscles as fans 
spring from their seats when a 
goal is scored. Breath chemistry 
therefore appears to ride the 
same emotional roller coaster as 
the live broadcast. 

Christof Sténner, Jonathan 
Williams Max Planck Institute 
for Chemistry, Mainz, Germany. 
jonathan. williams@mpic.de 


Journals should drive 
data reproducibility 


Peer-reviewed journals — as well 
as researchers and their funders 
— must take responsibility for 
improving the reproducibility of 
published results (see Nature 533, 
452-454; 2016). 

I suggest that journals should 
be required to sign a global 
statement indicating that, to the 
best of their knowledge, the data 
that they publish are reproducible. 
This statement would be 
collaboratively formulated by the 
editors-in-chief in accordance 
with recommendations from 
the International Committee 
of Medical Journal Editors and 
guidelines proposed by the US 


BREATH CHEMISTRY OF FOOTBALL FANS 


People exhale bursts of carbon dioxide and isoprene whenever a goal is scored. 
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National Institutes of Health, 
Nature and Science (see Nature 
515, 7; 2014 and go.nature. 
com/29bxphv). 

Journals would then publish 
only papers that are accompanied 
online by full experimental 
protocols, raw data and source 
code, as in the Protocol Exchange 
repository (www.nature. 
com/protocolexchange). For 
manuscripts containing statistical 
analyses, journals should peer 
review only those papers that 
use statistics environments 
based on source code, enforcing 
the ban on ‘point-and-click 
statistical software (see go.nature. 
com/29pdpcl). 

Gregorio Santori University of 
Genoa, Italy. 
gregorio.santori@unige. it 


Don’t mar legislation 
with pseudoscience 


Weare concerned that some of 
the European Union's processes 
for setting safety regulations for 
chemicals are being influenced 
by media and pseudoscience 
scaremongering. Pseudoscience 
has no place in such decisions, 
which should be based purely 
on well-defined and transparent 
evidence. 

For example, endocrine 


disruptors are being blamed 

for obesity and type 2 diabetes 

(J. Legler et al. J. Clin. Endocrinol. 
Metab. 100, 1278-1288; 

2015) despite the absence of 
supporting evidence for this, 
and despite food and sugar 
over-consumption being 
established as a proven cause. 

Asa consequence, the European 
Commission's criteria for 
regulating endocrine-disrupting 
compounds asa threat to human 
health are based on correlational, 
not causal, studies (see go.nature. 
com/29rjlik). 

Conflicts of interest can 
contribute to the problem, 
beyond the commercial 
motivation of industry. 

Some non-governmental 
organizations might need to 
maintain public concerns to 
boost charitable donations. 
Decision-makers might prefer to 
disregard evidence-based data 
that contradict a precautionary 
viewpoint. And some scientists 
put securing research funds 
above objective appraisal of the 
evidence. 

Acting on hazard 
identification alone relieves 
the scaremongering party of 
the burden of proof, when 
harm is simply assumed. As a 
result, regulations can become 
unnecessarily restrictive. They 


may even be damaging, for 
example if an agricultural ban 
were to be imposed on triazole 
fungicides because of their 
endocrine-disrupting potential. 
The risk to humans at such levels 
of exposure would be negligible 
(J. E. Chambers et al. Crit. Rev. 
Toxicol. 44, 176-210; 2014). 

It makes no sense to override 
such evidence with a blanket 
ban on potentially hazardous 
chemicals that ignores the 
public’s demonstrable low level 
of exposure. 

Daniel R. Dietrich* University of 
Konstanz, Germany. 
daniel.dietrich@uni-konstanz.de 
*On behalf of 8 correspondents (see 
go.nature.com/29kbyqb for full list). 


Drug repositioning 
needs a rethink 


Repurposing drugs to treat 
illnesses for which they were not 
originally intended can be faster 
and cheaper than developing 
new ones (see Nature 534, 
314-316; 2016). I suggest that 
greater improvements would 
come from testing different 
drug combinations, rather than 
relying only on high-throughput 
screening of generic or failed 
drugs. 

Disease is often an integration 
of multiple pathologies (see, for 
example, J. N. Weinstein et al. 
Nature 507, 315-322; 2014), so 
these are potentially treatable 
with different drug combinations 
that act in synergy. Such 
combinations often show better 
efficacy than single treatments, 
have fewer side effects and 
are less likely to result in drug 
resistance (see Nature 492, 
118-122; 2012). 

For commercial reasons, 
pharmaceutical firms tend to 
dismiss reposition testing of drugs 
that are off patent. I therefore 
suggest that governments step 
in to fund the repurposing of 
established drugs to broaden the 
search. 

Xianting Ding Shanghai Jiao 
Tong University, Shanghai, China. 
dingxianting@sjtu.edu.cn 
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OBITUARY 


Robert Treat Paine 


(1933-2016) 


Ecologist who established concept of keystone species. 


dominated by descriptions of 

patterns in nature and assertions 
that these patterns were determined 
by physiological tolerances, energy 
flows through ecosystems or compe- 
tition among similar species. 

Robert Treat Paine changed the 
field’s course with a simple experi- 
ment. He removed ochre starfish 
(Pisaster ochraceus) from a seashore 
in Washington state, revealing that 
a single predator could control the 
abundance, diversity and distribution 
of other organisms sharing its ecosys- 
tem. Thus was born Paine’s concept of 
a ‘keystone species’ — one that influ- 
ences most of the other species in its 
community, just as the wedge-shaped stone at 
the top ofan arch holds the structure in place. 

Paine, who died on 13 June, was born 
in Cambridge, Massachusetts, in 1933; his 
mother was a photographer and writer, his 
father an art historian. As a boy, he spent 
hours exploring the woods, acquiring a feel 
for the rhythms of nature and honing his 
powers of observation. 

Paine developed a passion for palae- 
ontology as an undergraduate at Harvard 
University in Cambridge. But his plan to 
study fossils as a graduate — at the University 
of Michigan in Ann Arbor — was derailed 
after he started attending classes taught by 
ecologist Frederick E. Smith. Smith was una- 
fraid to ask provocative questions about pat- 
terns in nature. Switching to ecology, Paine 
studied a living fossil: a species of lamp shell, 
or brachiopod, for his 1961 PhD. 

Following a postdoctoral fellowship at the 
Scripps Institution of Oceanography in La 
Jolla, California, in 1962, Paine joined the 
zoology faculty at the University of Wash- 
ington in Seattle. There he conducted his 
now-classic starfish experiments, developed 
other transformative ideas and established a 
dynasty of experimental ecologists up to and 
beyond his official retirement 36 years later. 

Paine’s studies at Michigan coincided with 
the development of an avant garde idea in 
ecology. Challenging conventional wisdom 
that the availability of prey regulated preda- 
tors, the zoology faculty troika of Smith and 
his colleagues Nelson Hairston and Lawrence 
Slobodkin proposed in 1960 the ‘green world 
hypothesis. In their view, the world was 
green because herbivores, who could other- 
wise consume all plant (green) matter, were 


E the 1960s, ecology was 
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regulated by their predators (N. G. Hairston 
etal. Am. Nat. 94, 421-425; 1960). The paper 
attracted considerable attention, but like all 
conceptual and most field ecology at the time, 
lacked solid field evidence. 

Enter Paine’s starfish. Paine demonstrated 
conclusively that, at least on one rocky shore, 
the top predator both controlled its prey and 
affected most of the other species in the com- 
munity (R. T. Paine Am. Nat. 100, 65-75; 
1966). The ochre starfish is a generalist preda- 
tor, consuming barnacles, snails, limpets and 
more. Its favourite food is the California mus- 
sel — the dominant competitor for space on 
the rocks. By controlling mussel numbers, 
starfish enable a wide diversity of species to 
co-exist, including other prey species, and 
an array of seaweeds, sponges and anemones 
that the starfish do not consume. When Paine 
removed the stars from the system, mussels 
quickly crowded out other species. Compa- 
rable dynamics were later demonstrated for 
keystone species in other marine, terrestrial 
and freshwater ecosystems. 

Paine expanded these ideas to include the 
concept of ‘trophic cascades’ — the rippling 
through a community of strong, top-down 
effects that affect multiple ecosystem levels 
and species. These ideas upended estab- 
lished theories about the dominance of 
‘bottom-up’ effects, such as those mediated 
by changes in rainfall or nutrients. Experi- 
mental evidence is hard to argue against, 
and it was clear from Paine’s experiments 
that single species could matter. 

His findings had a major effect on con- 
servation. Sharks, for example, are now 
understood to have key roles in controlling 
the distribution, abundance and diversity of 


many species in ocean ecosystems. 

Paine delighted in romping around 
the rocky intertidal zone, especially at 
his long-term study site on Tatoosh 
Island off the Olympic peninsula of 
Washington. He was in his element 
in the field — steering his outboard 
motor boat through swells, navigat- 
ing his two-metre frame over slip- 
pery algae, hauling supplies up steep 
cliff faces, peppering everyone with 
observations, and generally outpac- 
ing students decades younger than 
him. He also took intense pleasure in 
devising ways to adapt hardware sup- 
plies to manipulate the abundance of 
species with different characteristics. 
A home-made fence or cage had to 
achieve its intended ecological purpose as 
well as withstand pounding waves and storm 
surges of a not-so-pacific ocean. 

Bob’s legacy is defined by his charisma 
and his brilliant mentoring of generations 
of ecologists, as much as by his research. 
He kindled curiosity, independent thinking 
and a willingness to observe closely and then 
manipulate nature to discern her secrets. 
Even after retiring, Bob actively supported 
his ever-expanding family of academic (as 
well as biological) children, grandchildren 
and great-grandchildren. 

Now recognized as one of the greatest 
ecologists in history, Bob was not afraid to be 
provocative, but understood that what mat- 
tered were results — especially experimental 
ones. His towering stature easily intimidated 
students until they spent time with him and 
discovered an encouraging and inspiring 
mentor and often a friend. 

Even early on, another giant in ecology, 
Robert MacArthur, who in the 1960s had 
championed the importance of competi- 
tion in structuring ecological communities, 
understood how profound Bob’s keystone- 
species findings were. Writing to him three 
months after the initial starfish results were 
published, MacArthur stated simply: “This 
changes everything.” And so it did. m 


Jane Lubchenco is distinguished professor 
of integrative biology at Oregon State 
University, Corvallis, Oregon, USA, and the 
first US Science Envoy for the Ocean. She 
switched fields in graduate school to work 
with an ecologist who opened her eyes and 
expanded her world: Bob Paine. 

e-mail: lubchenco@oregonstate.edu 
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CLIMATE SCIENCE 


Cooling in the Antarctic 


The Antarctic Peninsula has been warming for many decades, but an analysis now reveals that it has cooled since the late 
1990s. Inspection of the factors involved suggests that this is consistent with natural variability. SEE LETTER P.411 


ERIC J. STEIG 


he Antarctic Peninsula is a triangular, 
| mountainous land with a coastline 
of dramatic, calving glaciers and rich 
wildlife, and it exemplifies the popular image 
of Antarctica (Fig. 1). Over the past half 
century, it has been one of the most rapidly 
warming places on Earth. This warming is 
associated with major physical and biologi- 
cal changes, including a decline in the Adélie 
penguin population’ and the disintegration 
in 2002 of a large portion of the Larsen B ice 
shelf, a major geographical feature that had 
existed for millennia’. It is natural to view the 
changes at this northernmost point in Ant- 
arctica as part of the inexorable southward 
march of anthropogenic climate change. It 
may thus seem remarkable that Turner and 
colleagues* now report (page 411) that the 
Antarctic Peninsula has actually cooled in 
the past two decades. 
Turner et al. show that the average tem- 
perature on the peninsula has decreased by 


about 0.5°C per decade since the late 1990s, 
about the same rate at which it warmed in the 
preceding five decades. To understand this 
apparent reversal, one has to recognize that 
the average warming trend observed since the 
middle of the twentieth century reflects several 
aspects of Antarctic Peninsula climate. Which 
aspect dominates varies by both location and 
season. For example, the oft-heard claim that 
the peninsula is the fastest-warming place 
on Earth is accurate only during the winter, 
as recorded at the Faraday/Vernadsky station 
on the west coast. Winter warming at this sta- 
tion — more than 6 °C since records began in 
1947 — is unambiguously associated with a 
rapid decrease in sea ice’. 

By contrast, significant summer warming 
has been largely restricted to the peninsula’s 
east coast and is related to poleward migra- 
tion of the circumpolar westerly winds, 
commonly described by an increase in the 
Southern Annular Mode (SAM) index’. The 
SAM is a mode of natural atmospheric vari- 
ability, but changes in the SAM index have 


been associated with increased greenhouse 
gases and the decline of stratospheric ozone’. 
Autumn is the only season during which both 
coasts of the Antarctic Peninsula have warmed 
together over multiple decades’. The autumn 
temperature trend has been associated with 
features of large-scale atmospheric circulation 
known as Rossby waves, which link changes in 
tropical convection (associated with El Nifio 
ocean-warming events, for example) with 
temperatures on the Antarctic Peninsula’. 
Turner et al. show that most of the factors 
that determine the peninsula’s temperatures 
have changed in such a way as to favour 
regional cooling, rather than warming, since 
the end of the twentieth century. They find 
that there has been no increase in the SAM 
index since the late 1990s, but that there have 
been stronger easterly and southeasterly near- 
surface flows towards the peninsula, counter- 
ing the long-term trend of increasing westerlies 
during the summer. Because southeasterlies 
tend to push sea ice towards the peninsula’s 
east coast, there has been greater sea-ice cover 


Figure 1 | Glacier ice at James Ross Island, Antarctic Peninsula. Turner et al.’ report that the peninsula has cooled during the past 20 years. 
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TOM TOBIN 


there during the past two decades, reducing 
the flux of heat from the ocean and further 
amplifying the cooling effects of the changes 
in atmospheric circulation. 

Moreover, the authors show that the average 
state of the tropical Pacific Ocean has changed, 
with the dominance of strong El Nifio events 
during the 1990s giving way to colder, pre- 
dominantly La Nifia conditions during the 
2000s. Although the relationship between the 
state of the tropics and Antarctic Peninsula 
temperatures is complex, the net result dur- 
ing autumn and winter has been increased 
easterlies and southeasterlies, similar to those 
that occur during summer. The changing state 
of the tropical Pacific similarly explains why 


Climate models” suggest that the signal 
of anthropogenic climate change is damped 
by the tendency of mean ocean transport to 
carry heat northward. This, combined with 
the large magnitude of natural variability, sug- 
gests that anthropogenic climate change may 
not be unambiguously detectable in Antarc- 
tica for several more decades. Many decades 
of data are required to meaningfully charac- 
terize decadal climate variability. Looking at 
surface-air temperatures’’ collected for more 
than 60 years, the long-term trend is one of 
warming. Importantly, the authors do not 
claim that this long-term warming has ended, 
and it would indeed be premature to come to 
such a conclusion. = 


temperatures over central West Antarctica 
south of the peninsula — increased markedly 
through the 1990s, but have since remained 
relatively stable’. 

A crucial point made by Turner et al. is that 
both the warming since the 1950s and the cool- 
ing since the late 1990s are entirely consistent 
with natural climate variability. This is to be 
expected: none of the known drivers of Ant- 
arctic Peninsula temperature change can be 
clearly associated with anthropogenic effects, 
apart from the summertime trend in the SAM 
index. Although recent changes in the peninsu- 
las climate have been large, the natural decadal- 
scale variability is also large, making short-term 
fluctuations inherently unpredictable even in 
the presence of strong forcing. 

Even before Turner and colleagues’ analysis, 
there was little evidence that the rapid warm- 
ing in Antarctica falls outside the range of 
natural variability. Indeed, the palaeoclimate 
record from ice cores has strongly suggested 
that it does not*” (with the possible exception 
of summer warming on the northernmost part 
of the peninsula, at James Ross Island’’). In 
short, Turner and co-workers’ findings should 
not be surprising. 

A potentially controversial aspect of the 
new findings is the authors’ conclusion that 
the observed peninsula cooling is independ- 
ent of the recent period of relatively modest 
global warming, sometimes dubbed the hiatus. 
An objective choice for the start of the cooling 
period is 1998, the year commonly cited for the 
start of the hiatus. Although several explana- 
tions for the hiatus have been proposed, there 
is little doubt that exchange of heat between 
the atmosphere and the tropical ocean has a 
major role. This heat exchange is sometimes 
discussed in terms of a negative phase of a 
phenomenon called the Interdecadal Pacific 
Oscillation (IPO, detected as warm or cool sur- 
face waters in the Pacific)’. Turner et al. argue 
that negative phases of the IPO index tend to 
increase temperatures on the Antarctic Penin- 
sula, which implies that the observed cooling is 
not connected to the hiatus. But relationships 
between tropical dynamics, global tempera- 
tures and Antarctic climate aren't likely to be 
captured by a single index. 


Eric J. Steig is in the Department of Earth 
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beginning for RNA 


Organic molecules called coenzymes are central to metabolism, but have 
also been found to act as components of RNA in bacteria. A study reveals how 
coenzymes are incorporated into RNA. SEE LETTER P.444 


KATHARINA HOFER & ANDRES JASCHKE 


he absence of a molecular cap structure 

at the start of messenger RNA mol- 

ecules has long been considered to be 
a hallmark of bacteria’. However, in the past 
decade”’, chemical modifications have been 
discovered in bacterial RNA that structur- 
ally resemble the caps found in mRNA from 
eukaryotes (organisms whose cells contain 
a nucleus). These modified building blocks 
are coenzymes, which are well known to bio- 
chemists for their role as helpers in enzyme- 
catalysed meta- 


bolic reactions. “The study 

On page 444, Bird provides afirst 
et al.’ report that the explanation for 
bacterial capping how coenzyme- 
mechanism is fun- modified RNA ; 
damentally different 1S synthesized in 
from that of eukary- bacteria.” 


otes. The authors 

then extend their study to unravel why some 
RNA sequences carry this cap, whereas others 
do not. 

RNA is generally composed of only four 
building blocks — the nucleoside molecules 
adenosine, cytidine, guanosine and uridine — 
linked by phosphate groups. However, many 
site-specific chemical modifications decorate 
RNA molecules, supporting diverse coding, 


© 2016 Macmillan Publishers Limited. All rights reserved. 


structural and catalytic functions’. The cap at 
the 5’ end of mRNA is one of the most promi- 
nent eukaryotic modifications, influencing 
processes as diverse as RNA splicing, intra- 
cellular transport and protein biosynthesis’. 
The attachment of this cap occurs in parallel 
with RNA transcription, thanks to the com- 
bined efforts of different enzymes’ (Fig. 1a). 

First, the enzyme RNA polymerase (RNAP) 
binds to a specific sequence called the pro- 
moter in DNA, and initiates RNA synthesis 
immediately downstream of this, at the tran- 
scription start site (denoted as the +1 position). 
RNAP assembles the RNA chain complemen- 
tary to the DNA template strand by linking 
nucleosides that have three phosphate groups 
attached (nucleoside triphosphates; NTPs), 
each linkage causing the concurrent release 
of two phosphate groups. The RNA carries a 
triphosphate group at its leading 5’ end, stem- 
ming from the NTP incorporated at the +1 
position. Once the growing RNA chain reaches 
acertain length, a cap structure is added to this 
end: enzymes first clip off a monophosphate 
group, then add a guanosine monophosphate 
molecule, and finally add a methyl group to 
the guanosine, thereby producing a mature 
cap structure. 

Bacteria do not possess this capping 
machinery, so textbooks say that their tran- 
scripts carry a 5’ triphosphate. But mass 
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spectrometry indicates*” that this 
is not wholly accurate. Analysis of 
chopped-down bacterial RNA revealed 
unexpected constituents that, owing to 
their structure, could reside only at the 
5’ end of RNA. These constituents were 
two old acquaintances, nicotinamide 
adenine dinucleotide (NAD*) and 
3'-dephospho-coenzyme A (dpCoA), 
central coenzymes in bacterial 
physiology. 

In 2015, the combination of a 
NAD*-specific enrichment proto- 
col and high-throughput sequence 
analysis revealed’ a set of bacterial 
regulatory small RNAs to have cap-like 
NAD* attachments that protect them 
from degradation and modification. 
An enzyme called NudC can remove 
the cap**, but it was not clear how 
NAD* got into these RNA strands in 
the first place. 

The sequence of all reported NAD*- 
RNA molecules starts with adenosine’, 
and NAD* and dpCoA contain an 
adenosine substructure. As a result, 
Bird et al. hypothesized that the coen- 
zymes compete with the regular NTP, 
adenosine triphosphate, in the ini- 
tiation step of transcription, therefore 
becoming incorporated into the RNA 
chain at the +1 position* (Fig. 1b). 
The authors’ cell-free transcription 
assays confirmed that bacterial RNAP 
can efficiently incorporate NAD”, its 
reduced form NADH, and dpCoA into 
full-length RNA transcripts. How- 
ever, incorporation is efficient only if 
the promoter sequence of the corre- 
sponding DNA is the same as those in 
the bacterium Escherichia coli that are 
transcribed to highly NAD*-capped 
RNAs’. 

Systematic investigation revealed that 
the promoter nucleoside immediately 
upstream of +1 (the nucleoside in the -1 
position) is a key determinant of coen- 
zyme incorporation, with the presence 
of adenosine at -1 conferring much 
higher levels of incorporation than 
cytidine. In a crucial experiment, the 
authors analysed the ability of various 
E. coli promoters to initiate transcrip- 
tion of NAD*-modified RNAs in vivo, 
and so demonstrated that these in vitro 
findings also apply in living cells. Fur- 
thermore, they showed that coenzyme 
capping results in a large increase in 
RNA stability in vivo. 


Finally, Bird et al. used X-ray crystal- 
lography to analyse the structural basis for 
coenzyme incorporation. They soaked crys- 
tals of an RNAP-promoter complex in a 
solution containing the coenzymes and NTPs, 
and observed that coenzyme capping could 


initiate RNA formation in the crystal. 
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Figure 1 | Capping mechanisms. a, During eukaryotic transcription, 


the RNA polymerase (RNAP) enzyme begins synthesizing RNA from 
DNA immediately downstream of the DNA promoter sequence, at 
position +1. RNAP links nucleosides — adenosine (A), cytidine (C), 
guanosine (G) and uridine (U), with U being incorporated in place of 
the DNA nucleoside thymidine (T) — that harbour three phosphate 
groups. The enzyme removes two phosphates as each molecule 

is added to the chain. After RNA reaches a certain length, other 
enzymes (not shown) cap the 5’ end: a phosphate group is removed, a 
guanosine monophosphate added and a methyl group added to that. 
b, By contrast, Bird et al.’ report that bacterial RNA is capped during 
the initiation of transcription. A coenzyme molecule (here, NAD") 
that contains an adenosine substructure competes with adenosine 
triphosphate for position +1 and is therefore directly incorporated at 
the 5’ end of RNA by RNAP. The nucleoside present at DNA position 
-1 determines the efficiency of coenzyme incorporation. 


The current study provides a first 
explanation for how coenzyme-modified 
RNA is synthesized in bacteria and demon- 
strates the fundamental differences between 
this process and typical eukaryotic capping. 
Furthermore, the work establishes the func- 
tional relevance of NAD* capping in vivo 
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— increasing RNA stability. The 
existence’ of the decapping enzyme 
NudC indicates that the stabilized 
transcripts can be destabilized and 
targeted for degradation if the meta- 
bolic need arises. 

It remains to be seen whether 
selective stabilization and degrada- 
tion are the only functions of these 
unconventional RNA ends. We 
recently proposed® that coenzymes 
and metabolites attached to RNA 
provide a layer of ‘epitranscriptomic’ 
information that modifies the activity 
of RNA without altering its sequence. 
Such information could endow RNAs 
with additional functions and facilitate 
their interaction with other biomole- 
cules. If this is the case, there must be 
ahigh level of specificity in the linking 
of a particular coenzyme to an RNA. 
Bird and colleagues’ study does not 
reveal whether RNAP can distinguish 
between different adenosine-contain- 
ing coenzymes in vivo. An interesting 
question is therefore whether there 
are other structural determinants of 
coenzyme incorporation besides the 
—1 nucleoside. It will also be interesting 
to see whether RNAP can incorporate, 
through similar mechanisms, uridine-, 
cytidine- and guanosine-containing 
biomolecules. 

Finally, Bird et al. demonstrated 
in vitro that eukaryotic RNAP can also 
incorporate coenzymes into RNA. 
Thus, this atypical capping might not 
be limited to bacteria. The diversity 
of cap structures and capping mech- 
anisms in eukaryotes may well be 
greater than currently anticipated. m 
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Fishing for lessons 
on coral reefs 


A global study has identified coral reefs with greater fish biomass than would be 
predicted given human and environmental pressures. These outliers might teach 
us something about sustainable coral-reef management. SEE LETTER P.416 
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reef-fish declines. In particular, there are fewer 
fish in locations with shorter travel times to 
large markets where people can sell fish. 

The effect of increasing market-based forces 
on decreasing fish biomass was stronger than 
the effects of many of the other factors that 
influence reef-fish biomass, including human 
population pressures, environmental condi- 
tions or local management actions. This result 
is crucial for coral-reef conservation because 
it suggests that market forces contribute to 
overfishing and that market-based interven- 


KRISTY J. KROEKER 


he human population is expected to 

reach 9.7 billion by mid-century’, with 

many of us concentrated in the nar- 
row, coastal zones of the world. The inevitable 
growing demand for food is of great concern 
to conservationists. Seafood is a key source of 
nutrition for people worldwide, yet unsustain- 
able fishing practices have already led to the 
degradation of many coastal ecosystems’, and 
these degraded ecosystems have a reduced 
capacity to support stable fisheries’. Strong 
fishing restrictions are in place for only 2% of 
the world’s oceans’, leaving the vast majority 
of marine ecosystems at risk from overfishing 
and continued degradation. On page 416, Cin- 
ner et al.” provide key insights into the social 
and economic conditions linked to declines in 
the biomass of coral-reef fish. 

Although there is a well-established link 
between overfishing and coral-reef health, 
we know less about the social, cultural 
and economic conditions that cause some 


communities to overfish while others fish more 
sustainably. Understanding the root causes of 
overfishing may allow individuals and organi- 
zations to address, and potentially alleviate, the 
conditions that contribute to overfishing, and 
to promote the socio-economic conditions 
associated with sustainable fisheries. 

Cinner et al. compiled ecological and 
environmental data for some 2,500 coral reefs 
worldwide, as well as the socio-economic 
data for the use of these reefs. The authors 
then compared how the biomass of fish on 
coral reefs varied with numerous social and 
economic factors, such as human population 
growth and levels of tourism, on both local 
and national scales, while also accounting 
for some environmental conditions (largely 
beyond human control) that influence reef 
fish populations. 

The authors find that conventional 
conservation approaches such as well-enforced 
reserves (areas that are not fished) generally 
increase fish biomass, but they also describe 
other socio-economic factors that lead to 


tions — such as eco-labelling or creating mar- 
kets for sustainably harvested fish — may be 
necessary to support sustainable fisheries on 
coral reefs. 

The authors also report a second compelling 
result. They identify 15 reefs that have much 
greater fish biomass than would be expected 
given the human and environmental pressures. 
These reefs constitute what are called ‘bright 
spots’ (Fig. 1), and can now be analysed to 
learn lessons that may point conservationists 
in new directions. 

Identifying and scaling up the lessons 
learned from individuals or populations that 
are outperforming those around them has 
been done successfully in the fields of human 
health, development (such as agricultural 
practices or resource use) and business man- 
agement, but this strategy is not a standard 
approach in marine-conservation studies. 
The benefits of understanding the factors that 
underlie outperformers — whether in coral- 
reef conservation or human health — are 
manifold and include the potential to generate 
local community involvement by identifying 
proven and practical solutions that can be 


Figure 1 | Bright spots. Cinner et al.° carried out a global analysis of fish biomass on coral reefs and identified locations (such as Raja Ampat, Indonesia, shown 
here) known as bright spots, where fish stocks were higher than expected given local human and environmental pressures. 
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scaled up in other locations. 

Five of the fifteen bright spots identified 
by Cinner and colleagues are in unpopu- 
lated areas. This leaves only ten other sites 
with which to explore how different socio- 
economic conditions might affect decisions 
to fish sustainably when access to markets is 
not limiting. The authors’ preliminary analyses 
suggest that bright spots tend to be associated 
with communities that are actively engaged 
in reef-management decisions, and which are 
highly dependent on the resources, or in com- 
munities that have customs in place to control 
usage. On the basis of these results, strategies 
that promote community engagement and 
local enforcement may be crucial for sustain- 
able management. 

The new results are compelling, but 10 sites 
out of 2,500 reefs is an extremely limited num- 
ber for a robust, quantitative analysis of bright 
spots (although the low number is interesting 
in its own right, suggesting that there are very 
few outliers among the world’s reefs). More 
in-depth socio-economic studies are needed 
to fully understand why these conditions sup- 
port high reef-fish biomass in some locations 
but not in others. 

One limitation of this study is the tight 
focus on the market-based causes of coral-reef 
degradation at a time when coral reefs are 
increasingly threatened by accelerating 


CONDENSED-MATTER PHYSICS 


environmental changes’. The precarious health 
of coral reefs was heart-breakingly illustrated 
in early 2016, when warm ocean tempera- 
tures caused severe bleaching (the expulsion 
of photosynthetic algae from corals, which can 
result in the death of the coral) of reefs glob- 
ally’, including at the iconic Great Barrier Reef". 
In their study, Cinner et al. also find that coral 
reefs that have experienced recent environ- 
mental shocks, such as bleaching or cyclones, 

have lower fish bio- 


“Preliminary mass than would be 
analyses expected on the basis 
suggest that of pressures such as 
bright spots market influences. 
tend to be In addition, the coral 
associated with reefs located near 
communities cooler, deeper waters 
that are actively where animals can 
engaged in reef- hide during unfavour- 
management able conditions had 
decisions.” higher fish biomass. 


Lessons can also 
be learned from the ‘dark spots’ (reefs that 
support fewer fish than would be predicted 
on the basis of human and environmental 
pressures) that were identified by Cinner and 
colleagues. Not surprisingly, several of these 
reefs were recently exposed to environmental 
disturbances, such as cyclones, highlighting the 
importance of considering how climate change 


Attractive electrons 
from nanoengineering 


Electrons repel each other because they are negatively charged. An experiment 
now confirms a fifty- year-old theory that electrons can also attract one another 
as a result of repulsion from other electrons. SEE LETTER P.395 


TAKIS KONTOS 


oulomb’s law describes the force that 

exists between two charged parti- 

cles, such as the mutual repulsion of 
negatively charged electrons, and has been 
confirmed by a multitude of experiments. 
However, electrons in matter can also attract 
each other. On page 395, Hamo et al.’ demon- 
strate an attraction mechanism that is purely 
electronic. They prove that the repulsion of 
electrons in solids can be turned into an attrac- 
tion with the help of other electrons, validat- 
ing a long-standing theory proposed by the 
physicist William Little’. 

The idea that electrons can attract each other 
is central to the theory of how conventional 
superconductors such as aluminium work. In 
the Bardeen—Cooper-Schrieffer (BCS) theory 
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of superconductivity’, electrons attract one 
another through lattice vibrations called pho- 
nons. This mechanism imposes constraints 
— for example, the characteristic energy of 
phonons, the Debye frequency, governs the 
temperature below which a material can super- 
conduct (the critical temperature). 

Soon after the advent of the BCS theory, 
Little proposed a different ‘excitonic attraction 
scheme, whereby electrons attract each other 
because of repulsion from other electrons, 
in close analogy with the attraction through 
phonons” (Fig. 1a). To illustrate the excitonic 
mechanism, he considered two electrons 
(the system) interacting with a polarizable 
molecule (the polarizer; let’s consider a dia- 
tomic molecule for simplicity). The polarizer 
has one electron that is free to move between 
the molecule’s two atoms. An electron in the 
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will continue to affect reefs in the future. 

Although analysis of bright spots may 
help to identify market-based management 
approaches for sustaining coral-reef fishes, a 
portfolio of conservation approaches is prob- 
ably still needed to address the numerous 
threats to coral-reef ecosystems, including cli- 
mate change. Without sustainable fisheries, we 
will not be able to maintain healthy coral reefs. 
And without healthy coral reefs, we will not be 
able to sustain the fisheries needed to feed the 
world’s growing population. = 
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system can repel the polarizer electron, leav- 
ing behind a positively charged region. The 
other electron in the system is then attracted to 
this region and, consequently, the two system 
electrons are attracted to one another. 

The characteristic energy of the excitonic 
mechanism is simply the energy difference 
between two electronic states of the polar- 
izer (the bonding and anti-bonding states)’, 
which can, in principle, be much larger than 
the Debye frequency of most metals. The 
excitonic mechanism was therefore expected 
to provide a higher critical temperature than 
the few kelvin that was, at the time, thought 
to be required by its phononic counterpart. 
Such high-temperature superconductivity is 
desirable for practical applications, and the 
possibility of achieving it triggered intense 
theoretical and experimental activity. It was 
thought that the excitonic mechanism could 
be responsible for the superconductivity of 
organic superconductors; however, after years 
of intense debate, it is now generally accepted 
that it is not’. 

Efforts to find a superconductor whose 
activity is based on the excitonic mechanism 
face two key problems: a material must be 
discovered in which Little’s proposal can be 
applied, and the excitonic mechanism in that 
material must lead to superconductivity’. 
Hamo et al. decided to separate the two prob- 
lems. They aimed to demonstrate the basic 
attractive mechanism in a material without 
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Figure 1 | The excitonic mechanism for attractive electrons. a, In 1964, the physicist William Little 
proposed a set-up to demonstrate his theory that electrons can attract one another because of repulsion 
from other electrons’. There are two key components to his proposal: a pair of electrons (blue) within an 
electron chain (the ‘system’), and a polarizable molecule (the ‘polarizer’), composed here of two atoms 
(red). b, Hamo et al.' use nanoengineering to realize Little’s set-up. In their experiment, the system 
consists of two electrons within a carbon nanotube and the polarizer is a pair of potential-energy wells 
(orange) within a separate carbon nanotube. The two nanotubes are placed perpendicularly to each other 
such that one of the polarizer’s potential wells is directly above the system. As the separation between the 
two nanotubes is reduced, the presence of the polarizer causes the system electrons to attract each other. 


requiring superconductivity. Furthermore, 
instead of trying to find the desired material 
in nature, the authors fabricated it using nano- 
metre-scale engineering techniques. 

Unlike Little's original proposal, the authors’ 
polarizer is an artificial molecule: a pair of 
potential-energy wells within a carbon nano- 
tube, which is placed on a microchip. The 
electron system is constructed inside a sepa- 
rate carbon nanotube, on its own microchip. 
The two nanotubes are then placed perpen- 
dicularly to each other (Fig. 1b), one above 
the other, inside a scanning probe micro- 
scope. After the apparatus is cooled to a low 
temperature (about 10 millikelvin), the nano- 
tubes are moved closer together until they are 
about 100 nanometres apart. The authors can 
then observe the effect of the polarizer on the 
behaviour of the system electrons and, in par- 
ticular, confirm whether or not the electrons 
attract one another. 

Hamo and collaborators’ experiment has 
several key assets. Their nanotube material is 
almost entirely free from impurities, which 
allows the electrical force acting on the elec- 
trons inside each nanotube to be carefully 
controlled. The particular set-up used by 
the authors also allows them to accurately 
assemble or disassemble the basic elements of 
Little’s proposal — the system and the 
polarizer — and study their formation and 
interactions. 

In contrast to work with real molecules, the 
researchers’ electrical-current measurements 
and charge-sensing of artificial molecules 
allow direct characterization of all of the rel- 
evant energy scales in the experiment. For 
example, the interaction between the system 


electrons can be determined by measuring 
the signal from electrometers built inside the 
carbon nanotubes, or simply from the current 
flowing through the devices as a function of 
the applied voltages. Using these techniques, 
the authors concluded that the electrons 
were attractive, confirming Little’s excitonic 
mechanism. 

In light of Hamo and colleagues’ findings, 
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one may now dream of engineering new 
types of superconductor. However, making a 
whole crystal — or even a short chain — of the 
authors’ material would be a formidable task. 
Furthermore, because the excitonic mecha- 
nism seems to be optimal in fewer than three 
spatial dimensions, superconductivity will 
inevitably compete with other phenomena 
that affect electrical behaviour (such as insu- 
lating charge-density waves)". In addition, the 
bonding-—antibonding splitting produced in 
the authors’ set-up requires low temperatures 
(in the sub-kelvin range). 

Nevertheless, the purely electronic attraction 
demonstrated is so generic that it could be 
transposed to any type of material in which 
some, or possibly all, of these constraints could 
be lifted. Although Hamo and collaborators’ 
set-up will not form the basis of a practical 
excitonic superconductor, it will be useful as 
a ‘quantum simulator’ of such superconduc- 
tors for basic research. It is therefore likely to 
become a key feature of the quantum toolbox 
of nanoscience. m 
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Nocturnal escape route 
for marsh gas 


A field study of methane emissions from wetlands reveals that more of the gas 
escapes through diffusive processes than was thought, mostly at night. Because 
methane is a greenhouse gas, the findings have implications for global warming. 


KATEY WALTER ANTHONY & 
SALLY MACINTYRE 


lobal wetlands are the largest 

natural source of atmospheric 

methane, a potent greenhouse gas. 
Methane is produced by microbes decompos- 
ing organic matter in oxygen-free sediment, 
and escapes from wetlands by three main 
pathways: through plants; by bubbling out of 
water (ebullition); and by diffusion of methane 
molecules across the microscopically thin air- 
water interface (Fig. 1). Most of the methane 
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emissions from lakes and wetlands have been 
credited to the first two pathways. But writing 
in Geophysical Research Letters, Poindexter et 
al.' report evidence indicating that diffusive 
emissions are larger than previously thought, 
and occur mostly at night. 

Diffusive emission of sparingly soluble 
gases such as methane from wetlands is 
controlled by passage across the air—water 
interface. Field data show that the emission 
can be quantified using a model in which the 
near-surface water is renewed by upwelling 
motions, for instance, those caused by 
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Figure 1 | Methane-transport pathways from wetland soils. Methane is generated in oxygen-free sediments in wetlands, and can escape to the atmosphere by 
bubbling to the surface (ebullition), by transport through plants (which is highest during the day), or by diffusion of molecules across the air—water interface. Heat 
loss from wetlands at night can cause convection that generates turbulent eddies, which renew dissolved methane just below the air—water interface, accelerating 
diffusive emissions. The larger eddies transport methane throughout the water column, whereas the smaller ones thin the diffusive layer immediately below the 
air—water interface. The size of the smaller eddies (500 micrometres to 1 centimetre across) and the rate of methane renewal varies with the intensity of turbulence. 
Poindexter et al.’ find that diffusive methane emissions from wetlands are much greater than was previously thought. Figure adapted from ref. 13. 


turbulent eddies, associated with wind or 
cooling”’. Methane transport across the 
air—water interface accelerates as turbulence 
intensifies, for example as the wind increases. 
Poindexter et al. are the first to quantify how 
heat loss from wetland surfaces accelerates 
diffusive emissions over seasonal and annual 
cycles, using a heat-flux-based equation that 
is similar to the surface-renewal model”. 
On clear, still nights, heat is emitted from the 
water surface by radiation; additional cooling 
from evaporation and conduction may occur 
even with light winds. Cooling surface water 
sinks, displacing the underlying water and 
forming eddies. Large eddies circulate dis- 
solved methane throughout the water column, 
whereas smaller, near-surface eddies renew the 
methane at the air—water interface, increasing 
methane flux to the atmosphere. Poindexter 
et al. use the term ‘hydrodynamic transport’ 
to describe this convective mixing driven by 
cooling; however, hydrodynamic transport is 
an umbrella term that usually pertains to mul- 
tiple processes that transport dissolved gases 
through water, including convection, internal 
waves (which occur within the body of the 
water), and the breaking of surface waves. 
Past field measurements (see refs 5 and 6, 
for example) of wetland methane emissions 
led scientists to conclude that diffusive trans- 
port was the least important of the processes 
that release methane to the atmosphere, but 
most of those measurements were made 
during daylight hours, when sunlight warms 
wetland surface water and creates stable density 
gradients that retard convective mixing. Using 
sensors mounted several metres high on a 
tower, Poindexter et al. took continuous, 
year-round measurements of wind speed, 
energy inputs and outputs to and from the 
water (including heat flux), total methane 
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flux and temperature from a shallow wetland 
dominated by tall emergent plants in the Sacra- 
mento-San Joaquin Delta, a temperate region 
in California. Day- and night-time meas- 
urements of the concentration of methane 
dissolved in surface water were also made on 
24 days distributed throughout the year. 

The authors calculated that convection- 
enhanced diffusion caused 18% of total day- 
time emissions on average. But when they 
incorporated the effect of nocturnal cooling, 
they found that convection-enhanced diffu- 
sion was the dominant transport pathway at 
night, contributing 54% of total night-time 
fluxes. Moreover, because the authors make 
several assumptions about the nature of flow 
near the air—water interface, their estimates 
are conservative. This discrepancy points to 
the need for more effort to quantify the effect 
of surfactants and near-surface turbulence on 
parameters used in the authors’ calculations’. 

Poindexter and colleagues’ results do 
not imply that total wetland emissions are 
higher than previously thought. Rather, they 
require a reassessment of how emissions are 
partitioned among the three methane-escape 
pathways in large-scale regional and global 
models of wetland emissions” ”’. A larger con- 
vection-enhanced diffusive flux implies lower 
emissions by plants and/or ebullition. The 
partitioning will be system specific, because 
several factors — including plant type and 
coverage, water depth and landscape geo- 
morphology — work together to control the 
relative contribution from each pathway. 

The demonstration that convection- 
enhanced diffusion is a larger source of 
atmospheric methane from wetlands than 
was previously thought is valuable for sev- 
eral reasons. First, it suggests that nocturnal 
cooling — which is not well accounted for in 
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commonly used wind-based formulations of 
diffusive emissions — is probably a signifi- 
cant factor in other sheltered systems such as 
thermokarst (thaw) ponds and flooded for- 
ests. Second, it has implications for tropical 
wetlands, which are particularly poised for 
strong convection-enhanced diffusive emis- 
sions because heat losses associated with 
evaporation and with radiation emitted from 
the water surface are enhanced when surface 
waters are warm. Methane concentrations in 
tropical wetlands are high and do not exhibit a 
strong winter decline as in temperate wetlands, 
which means that the fraction of convection- 
enhanced emissions from tropical ecosystems 
could be higher than those reported in the 
current study from a temperate wetland. 

A third implication of the work stems from 
the lack of day-to-night differences in surface- 
water methane concentrations in the shallow 
California wetland, which implies that there 
was no strong vertical gradient in methane 
concentration. Pronounced concentration 
gradients occur beneath the surface layer in 
many other water bodies, such that the lower 
part of the water column is methane-rich. Mix- 
ing from convection will entrain methane from 
the lower water column in those bodies, bring- 
ing it to the surface where it can escape to the 
atmo sphere’’. In addition, convection reduces 
the time that dissolved methane spends in water 
and thus reduces its consumption by microbes; 
this shorter residence time has the potential to 
further increase methane fluxes. 

Convection-enhanced methane emissions 
are probably largest in warm climatic regions, 
but most small, methane-rich water bodies are 
located in cooler locations at high northern 
latitudes’. However, these cooler areas are pro- 
jected to undergo temperature increases of up to 
7.5°C by the end of this century”. This warming 


will lead to higher methane production and 
to convection-enhanced diffusion becom- 
ing an increasingly significant mechanism for 
releasing this methane to the atmosphere, con- 
stituting a newly recognized positive feedback to 
climate warming. In the meantime, Poindexter 
and colleagues’ results underscore the need 
to consider wetlands and lakes as dynamic 
systems, even on clear, calm nights. m 
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heterogeneity revisited 


Two analyses of insulin- producing f-cells reveal differences in what has long 
been considered a homogeneous population. These differences might reflect 
changes during maturation or ageing, or distinct cell lineages. SEE LETTER P.430 


SUSAN BONNER-WEIR & 
CRISTINA AGUAYO-MAZZUCATO 


iabetes is associated with an inadequate 
D supply of functional pancreatic B-cells, 

which normally maintain healthy 
blood-glucose levels by secreting insulin 
in response to raised levels and shutting off 
secretion when glucose is scarce. These cells 
have been regarded as a single, homogeneous 
population, even though studies have shown 
that B-cells are functionally immature in the 
neonatal period immediately after birth’, and 
that adult B-cells are not all identical**. Now, 
two papers (Bader et al.° on page 430 and 
Dorrell et al.° in Nature Communications), 
identify genetic markers that allow subpopu- 
lations of B-cells to be isolated and studied. 
These studies are bound to revive interest 
in B-cell heterogeneity and highlight the 
relevance of this research area for understand- 
ing and enhancing cell-replacement therapies 
for diabetes. 

Although all adult B-cells may seem the 
same when analysed using stains for insulin, 
subpopulations can differ in terms of insulin 
secretion” and insulin-expression levels’, 


length of chromosome end structures called 
telomeres® (which correlates with the number 
of times the cell has divided) and expression 
of the gene p16Ink4a, a marker and effector 
of B-cell growth arrest (senescence)’. Bolster- 
ing the idea that this heterogeneity is func- 
tional are differences in B-cell sensitivity to 
glucose’® and the fact that the cells can be 
recruited by increased glucose levels to adopt 
biosynthetic’ or active secretory states’, 
adapting to the demand for more insulin 
secretion. 

Pancreatic islets are micro-organs composed 
of several hormone-secreting cell types, 
including B-cells. During the development of 
islets, cells are organized into a 3D architecture 
through a mechanism called planar polarity. 
Bader et al. investigated B-cell expression of 
a planar-polarity protein called Flattop (Fltp) 
in mice. 

The authors replaced the gene encoding 
Fltp with a gene that encodes a fluorescent 
reporter protein dubbed FVR, so that cells 
that normally express Fltp fluoresced. In 
this way, they defined two B-cell subpopula- 
tions — proliferative immature cells that did 
not express Fltp or fluoresce (FVR>), and 
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50 Years Ago 


It is not surprising that the Prime 
Ministers of France and Britain 
should have been grateful for a few 
crumbs of agreement to fill out the 
communique after a recent meeting 
in London. By all accounts, British 
membership of the European 
Economic Community was not 

a fruitful topic of conversation. It 
is therefore paradoxical that the 
two Prime Ministers should have 
considered this a suitable time 

for a decision in principle that 

the tunnel beneath the English 
Channel should in due course be 
built... The case for a tunnel is 
wearing thin. Alternative ways of 
crossing what is, after all, a narrow 
strip of water have become more 
attractive ... It looks very much 

as if the two governments may be 
making the right decision at the 
wrong time... it is to be hoped that 
the two governments will have 
another close look at the wisdom 
of the course to which they have 
committed themselves before the 
digging ofa tunnel actually begins. 
From Nature 23 July 1966 


100 Years Ago 


Like ourselves, the industrial cities 
of the United States are beginning 
to realise the serious economic 

and hygienic effects caused by the 
unscientific combustion of coal... 
Dr W.E M. Goss has contributed 
to a paper on “Smoke as a Source 
of Atmospheric Pollution” ... The 
author is not very optimistic in 

his outlook, for he considers that 

a revolution in practice which will 
result in the elimination of existing 
sources of atmospheric pollution is 
not to be expected “because present- 
day knowledge is insufficient to 
supply necessary means”... We 

are throwing away ...a valuable 
inheritance which should belong to 
coming generations, and which they 
will never be able to recover. 

From Nature 20 July 1916 
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Figure 1 | Separating B-cell subtypes. The insulin-secreting B-cells of the pancreas are thought to renew 
slowly, so, at any one time, cells of different ages will be present in the tissue. Immature cells are highly 
proliferative and do not respond robustly to glucose stimulation, but secrete some insulin even at basal 
(low) blood-glucose levels. Mature cells proliferate in response to external stimuli and release insulin 
robustly, secreting little or no insulin under basal conditions. Old cells cease to proliferate (they become 
senescent) and may be dysfunctional. Bader et al.” show that immature cells do not express the protein 
Flattop (Fltp), but mature cells do. Dorrell et al.° find that subsets of B-cells express the proteins ST8SIA1 
and/or CD9. Although it was not tested, it could be hypothesized that these cells might be senescent, 
because they show impaired glucose-stimulated insulin secretion compared with the B-cells from the 


same samples that lack these markers. 


mature, quiescent fluorescing cells (FVR’). 
Eighty per cent of B-cells were FVR* in adult 
mice, as were about 50% of other islet cells. It is 
of considerable interest that 20% of adult cells 
are FVR , because it has been reported*”” that 
20% of B-cells in adult rodents are unrespon- 
sive to glucose. 

The authors next separated and further 
analysed each subpopulation for gene expres- 
sion, ultrastructure and insulin secretion. 
These experiments supported the idea that Fltp 
isa marker of mature B-cells. Furthermore, an 
in vitro tracing analysis confirmed that B-cells 
progress from FVR’ to FVR' over time. FVR™ 
cells proliferated more than FVR’ cells during 
the normal B-cell population expansion seen 
during the neonatal period and pregnancy. By 
contrast, when mice were fed a high-fat diet, 
which increases the demand on B-cells, cells in 
the FVR* subpopulation increased in size, thus 
presumably increasing their insulin-secreting 
capacity. 

Fltp seems to be a marker, rather than an 
effector, of the B-cell transition from immature 
and proliferative to mature and quiescent — 
deletion of Fltp in mice did not impair B-cell 
development, proliferation or maturation. 
Adult mice that lacked the protein had nor- 
mal glucose tolerance, insulin sensitivity and 
levels of pancreatic insulin and glucagon, a 
hormone that raises blood glucose. However, 
Bader et al. found that activation of a signal- 
ling cascade called the Wnt/planar cell-polarity 
pathway, of which Fltp is a downstream effec- 
tor, enhanced human and mouse B-cell matu- 
ration in vitro. Moreover, manipulating cells 
into 3D aggregates had the same effect, indicat- 
ing that planar polarity is important for B-cell 
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maturation. In the future, the development of 
specific antibodies that bind Fltp will facilitate 
a range of experiments to study its role in the 
human pancreas. 

In the second study, Dorrell et al. provide 
further evidence for the existence of B-cell 
heterogeneity, this time in humans. Previ- 
ously’®, the same group used antibodies that 
bind cell-surface proteins to identify hormone- 
secreting cell types in partially purified isolated 
human islets. In the current paper, they report 
the development of two more antibodies 
that allowed them to distinguish four sub- 
types of B-cell in the adult human pancreas. 
The antibodies bind molecules whose role 
in B-cells is unknown — CD9, a cell-surface 
glycoprotein that in other systems suppresses 
cell proliferation, and ST8SIA1, a membrane- 
bound glycosphingolipid involved in cell 
adhesion and cell growth. 

The authors found consistent proportions 
of the four B-cell subtypes in healthy adults. 
Expression of genes crucial for B-cell identity 
was similar in each subgroup, but expres- 
sion of around 100 other genes of mostly 
unknown function in B-cells differed. Cells 
that expressed both CD9 and ST8SIA1 
released more insulin in basal conditions and 
were less responsive to glucose stimulation 
than those negative for both markers. This 
impaired responsiveness may be relevant for 
disease, because B-cell function is impaired 
in type 2 diabetes, and Dorrell and colleagues 
showed that the two ST8SIA 1-expressing sub- 
types accounted for 45% of B-cells in people 
with this disease, compared with 18% in 
healthy adults. 

What is the biological reason for B-cell 
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heterogeneity? We have hypothesized” that 
young B-cells differ from those in the early 
stages of maturation or in midlife, those that 
are mature and have stopped dividing, and 
those that are old or dying. If this is true, the 
observed subtypes might reflect different 
stages of the B-cell life cycle (Fig. 1). 

Although Bader et al. showed that mouse 
cells transition from the Fltp-lacking to the 
Fltp-expressing subgroup over time, it is 
unclear whether the four human £-cell sub- 
types observed by Dorrell and colleagues have 
a temporal relationship or are independent 
lineages. It also remains unclear whether the 
increased proportion of ST8SIA1-expressing 
subtypes in people with type 2 diabetes is the 
result of selective loss of functionally active 
B-cells that lack ST8SIA1, or arises because 
of progressive dysfunction of B-cells owing 
to the cellular stress of diabetes. Regardless of 
whether the four human subtypes are separate 
lineages or sequential stages, their heterogene- 
ity should be considered in research that aims 
to generate optimal cells for B-cell replacement 
therapy, because these subgroups may differ in 
their proliferative capacity or vulnerability to 
cellular stress. 

Together, these papers revisit and underscore 
the need to consider B-cells as a heterogeneous 
population comprising cells at different stages 
of development with different functional and 
proliferative characteristics. Thanks to the 
current studies, the tools to explore these 
differences further are now available. m 
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A comprehensive transcriptional map of 
primate brain development 
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The transcriptional underpinnings of brain development remain poorly understood, particularly in humans and closely 
related non-human primates. We describe a high-resolution transcriptional atlas of rhesus monkey (Macaca mulatta) 
brain development that combines dense temporal sampling of prenatal and postnatal periods with fine anatomical 
division of cortical and subcortical regions associated with human neuropsychiatric disease. Gene expression changes 
more rapidly before birth, both in progenitor cells and maturing neurons. Cortical layers and areas acquire adult-like 
molecular profiles surprisingly late in postnatal development. Disparate cell populations exhibit distinct developmental 
timing of gene expression, but also unexpected synchrony of processes underlying neural circuit construction including 
cell projection and adhesion. Candidate risk genes for neurodevelopmental disorders including primary microcephaly, 
autism spectrum disorder, intellectual disability, and schizophrenia show disease-specific spatiotemporal enrichment 
within developing neocortex. Human developmental expression trajectories are more similar to monkey than rodent, 
although approximately 9°% of genes show human-specific regulation with evidence for prolonged maturation or neoteny 


compared to monkey. 


The primate brain develops through a series of stereotyped processes 
that are conserved across mammals', including the specification, 
migration, and maturation of diverse cell types and the formation and 
refinement of functional neuronal circuits. There are also primate- 
specific features of brain development that increase anatomical’, 
cognitive*, and behavioural complexity’, and may explain why many 
human neurological and neuropsychiatric diseases are inadequately 
modelled in rodents’. These features include a larger cortical progen- 
itor pool in the outer subventricular zone not present in rodents” and 
protracted myelination®, synapse production’, and pruning®. Rhesus 
monkeys and humans share a greatly expanded neocortex and special- 
ization of areas (most notably primary visual cortex”), compared to 
mouse brains, reflecting the more recent common ancestor of humans 
and rhesus monkeys (25 million years ago) than humans and mice 
(70 million years ago)’. Likewise, the rhesus monkey and human 
brain’ share more similar patterns of gene expression than do mouse 
and human brains!!. The rhesus monkey thus provides a valuable proxy 
for human brain development, particularly during prenatal and early 
postnatal development stages that are difficult to sample in humans, 


and also provides a comparator to study human-specific features such 
as prolonged maturation or neoteny®!”"?. 

Molecular programs responsible for brain development remain 
incompletely understood in any species, due to the spatiotemporal 
complexity of these processes and the resource intensity of methods 
needed to probe them. Transcriptome-based approaches have accel- 
erated understanding of variation in gene expression programs related 
to brain structure and function in adult and developing humans!*"!7 
and rhesus monkeys!™!8, albeit with limited anatomical resolution and 
developmental coverage. Recent studies in adult mice!*”° and humans”! 
have profiled individual cortical cells and demonstrated robust tran- 
scriptional differences between neuronal and non-neuronal cell types 
and large-scale changes over the course of development. Although 
these approaches offer cellular resolution in targeted domains, a global 
picture of gene expression changes over development across the com- 
plete cellular milieu has been lacking. 

This project had the aim of creating a transcriptional atlas of non- 
human primate brain, combining fine anatomical precision with 
dense temporal coverage across prenatal and postnatal development 
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in regions associated with neuropsychiatric disease. These transcrip- 
tional data are complemented by imaging and histology-based ana- 
tomical reference and cellular-resolution in situ hybridization (ISH) 
data sets, all publicly accessible through the NIH Blueprint Non- 
Human Primate (NHP) Atlas (http://www.blueprintnhpatlas.org and 
http://www.brain-map.org). 


Spatiotemporal transcriptome analysis 

This resource characterizes rhesus monkey forebrain development 
with three modalities: (1) anatomical reference data sets consisting of 
magnetic resonance imaging (MRI) and corresponding densely sam- 
pled Nissl stains; (2) cellular-resolution gene expression data gener- 
ated with a high throughput ISH platform; and (3) fine anatomical 
resolution transcriptome time series data generated with a combina- 
tion of laser microdissection (LMD) and DNA microarrays (Fig. 1a). 
Molecular analyses focused on discrete progenitor and postmitotic cell 
populations in neocortex (medial prefrontal and visual cortical areas), 
hippocampus, striatum, and amygdala (Fig. 1b and Supplementary 
Tables 1 and 2). Ten developmental time points (the ten ‘ages’ referred 
to hereafter) were chosen to correspond to peak periods of neurogen- 
esis for neurons destined for different layers and glial cell types of V1 
(prenatal) and major developmental epochs: neonate, infant, juvenile, 
and young adult (Fig. 1c, postnatal). Anatomical parcellation at each 
time point was based on prior work in monkey and human brain 
development*’” (Extended Data Fig. 1). Canonical genes marking 
different neural cell types showed expected spatiotemporal patterning 
across development, shown as heat maps (Fig. 1d and Extended Data 
Fig. 2) laid out following the anatomical organization in Fig. Ic. 

To gain a high-level understanding of transcriptional dynamics 
during cortical development, we represented the similarity between 
cortical samples using multidimensional scaling (MDS) based on cor- 
related expression (Fig. le-g). Layer and age explained almost 90% of 
the variation across all samples (n = 922; Fig. le) and prenatal samples 
(n= 542; Fig. 1f). To examine smaller differences between neuronal 
populations (grey circles in Fig. 1f), we repeated the MDS analysis 
only on V1 cortical layers (containing different types of excitatory 
neurons; n = 175; Fig. 1g). Transcriptional similarity of layers reflects 
spatial proximity and birth date, as described in adult rhesus monkey”. 
Moreover, continuous variation across prenatal development suggests 
gradual changes in gene expression. 

There were few sex-related differences in expression, although the 
study was not powered to detect subtle differences restricted to spe- 
cific brain regions or developmental stages. Only one autosomal gene 
(LOC693361, NADH-ubiquinone oxidoreductase chain 1-like) showed 
robust increased expression in males across brain regions during pre- 
natal development (Extended Data Fig. 3 and Supplementary Table 3). 
However, the microarray probe for LOC693361 also targets an unanno- 
tated region of rhesus monkey Y chromosome”, so there are probably 
no autosomal genes with persistent male- or female-specific expression 
differences across development. 


Developmental transcriptional dynamics 

Previous work has demonstrated that different cortical layers have 
distinct transcriptomic profiles both in adult and developing cortex, 
where the largest differences are seen between germinal and post- 
mitotic cell populations!”. In grossly dissected developing human 
prefrontal cortex, gene expression changes fastest prenatally and 
slows sharply just after birth'*. If dividing cells drive most expression 
change, then depletion of the progenitor pool would explain decreased 
rates. In contrast, we find that all layers of V1 and prefrontal cortex 
(anterior cingulate gyrus; ACG) have similar rates of expression 
change across development (Fig. 2a). Thus transcriptional programs 
change dramatically both in progenitor cell populations as they give 
rise to different neuronal and non-neuronal cell types, as well as in 
postmitotic neuronal cell types as they differentiate and mature into 
adulthood. 
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Similar developmental trajectories were seen in hippocampus, basal 
ganglia and amygdala (Extended Data Fig. 4a). However, the genes 
changing dynamically could differ by region or age. We identified the 
top 1,000 genes with the largest increase and decrease in relative expres- 
sion between all pairs of ages for each structure and visualized the over- 
lap of these gene lists as a heat map (Fig. 2b). This threshold was selected 
because at least 1,000 genes significantly changed expression (ANOVA 
false discovery rate (FDR) <0.05, fold change >1.25) between all adja- 
cent ages in the majority of regions (Extended Data Fig. 4b). Genes 
changing over time were remarkably synchronized across different cell 
populations. At every age, approximately half of the top increasing and 
decreasing genes were shared by most brain regions, and unsupervised 
clustering grouped samples almost perfectly by age (age index bar in 
Fig. 2b). Genes increasing with age showed substantial overlap across 
ages within, but much less between, prenatal and postnatal ages. This 
suggests that distinct transcriptional programs are active in prenatal 
and postnatal development, and that many of these programs progress 
gradually. At early stages, samples grouped by proliferative state inde- 
pendently of brain region (mitotic index bar in Fig. 2b) reflecting a 
common set of cell division genes. Postnatally, samples grouped more 
by anatomical structure, reflecting increased regional identity during 
brain maturation (structure index bar in Fig. 2b). 

To interpret these dynamic gene expression patterns, we searched 
for enrichment of Gene Ontology (GO) terms at each region and 
age (Supplementary Tables 4 and 5) and represented significant 
enrichments as heat maps ordered as above (Fig. 2c). For example, 
genes related to onset (positive regulation) of axonal projection were 
enriched during prenatal periods as expected. Surprisingly, this process 
appears to be actively repressed during late postnatal development, 
as genes associated with offset (negative regulation) of axonal pro- 
jection showed increased expression from juvenile (12 months) to 
young adult (48 month) stages. Processes associated with synapse 
development are temporally enriched in late prenatal and early post- 
natal periods, and also synchronized between presynaptic (synaptic 
vesicle localization) and postsynaptic (dendrite development) neurons. 
Interestingly, the region-specific blocks of coordinated expression in 
postnatal visual cortex (V1 and V2) and hippocampus (Fig. 2b, arrows) 
were enriched for autophagy genes (Extended Data Fig. 4c), which 
may reflect early dendritic spine pruning” and grey matter volume 
reduction” in visual cortex. 


Regional timing of biological processes 
Some biological processes, such as synaptogenesis and myelination, are 
protracted over development“, and we find that process activation, 
inferred from increased expression of process-related genes at each 
age (‘increasing’), is longer than inactivation (‘decreasing’; Fig. 3a). 
Biological processes persist longer in all regions pooled together (black 
lines) than in any individual region (coloured lines), so developmental 
timing must vary between regions. We quantified synchrony between 
pairs of regions as the proportion of ages during which a process is 
active in both versus either region, for processes active in all regions 
at some age (for example, Extended Data Fig. 5a). Remarkably, despite 
major differences in cellular and functional makeup of brain regions 
sampled, all pairs of regions showed similar degrees of synchrony across 
all GO terms. This synchrony was centred on overlap of approximately 
half of all ages (Extended Data Fig. 5b), far greater than expected by 
chance. Few processes—cell proliferation, projection, and adhesion— 
were synchronized across all regions and ages (Supplementary Table 5). 
Processes usually started before birth in all regions, but sometimes at 
strikingly different times in different regions (Fig. 3b). Consistent with 
known early maturation of subcortical circuits', rank ordering age of 
onset for each process across regions showed earlier subcortical onset, 
particularly in the amygdala, and similar timing for neocortex and hip- 
pocampus (Extended Data Fig. 5c). For example, subcortical globus 
pallidus is myelinated before cortical white matter*® and shows early 
increased expression of genes specific to myelinating oligodendrocytes 
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Figure 1 | High-resolution transcriptional profiling of rhesus monkey 
brain development. a, Neuroimaging, histological and transcriptome 
data components. b, Brain regions analysed. c, Primary visual cortex 
(V1) sampling paradigm for transcriptome analysis spanning ages 
from 40 embryonic days (E40) to 48 months (mo) after birth, with 
salient developmental events or stages noted (top row). Whole brain or 
hemisphere sections are shown for each age (V1, red box outline), as well 
as a high magnification Nissl image detailing laminar microdissections. 
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Cortical layers profiled at each stage are colour-coded by predominant 
mitotic (red, pink) or postmitotic (orange, yellow, tan) cell makeup. 

d, Heat maps of canonical cell type marker gene expression organized 
anatomically as in c. e-g, Multi-dimensional scaling (MDS) plots showing 
the first two principal axes of variation for all cortical samples (n = 922) 
(e), prenatal samples (n = 542) (f) and postmitotic layers in V1 (n = 175) 
(g). Scale bars in c: 1cm (top and middle rows) or 1 mm (bottom row). 
Structure abbreviations are given in Supplementary Table 1. 
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Figure 2 | Transcriptional dynamics across brain regions and ages. 

a, Similar decreasing rates of gene expression change across cortical layers 
(rows, ordered from basal to apical) and areas (anterior cingulate gyrus, 
ACG, red; primary visual cortex, V1, blue). b, Overlap of temporally 
dynamic genes across brain regions and ages. Proportion of the top 1,000 
genes with increasing (red, upper triangle) or decreasing (blue, lower 
triangle) expression for each region between pairs of adjacent ages 


(MOG, MOBP, and ERMN) (Fig. 3c and Extended Data Fig. 5d). 
Interestingly, three other oligodendrocyte markers (MAL, ASPA, and 
OPALIN) turned on synchronously in all regions, suggesting a temporal 
and regional dissociation of different stages of myelination. 


Late emergence of mature cortical identity 

The neocortex is composed of diverse neuronal subtypes organized 
into layers that have distinct connectivity and transcriptional signa- 
tures in the adult!®?°”’ with graded and discrete variation between 
cortical areas!>'®8, A neuron develops its mature molecular identity 
through cell-autonomous processes and environmental interactions 
during migration, process extension, and synapse formation. If early 
generated neurons closely resemble their mature states, then environ- 
mental interactions must have small effects on their terminal molecular 
identities. As we found extensive expression changes within postmitotic 
cortical layers (see Fig. 2a), there is a role for either or both of these 
mechanisms. 

To examine the emergence of mature laminar signatures, we com- 
pared layer-enriched expression patterns across development from the 
earliest stage at which all postmitotic layers are present (embryonic 
day 80 (E80)) to adulthood. We calculated the average expression cor- 
relation, across layers and between all pairs of ages, of the top 2.5% of 
genes most specific for each layer in V1 at each age (see Methods and 
Supplementary Table 6). Adult identity emerged gradually (Fig. 4a), as 
evidenced by decreased expression similarity with increased age sepa- 
ration. In fact, laminar expression patterns in young adulthood barely 
resembled mid-fetal cortex (R < 0.1). 

These correlation trends result from nearly complete shifts in genes 
showing robust layer-specific expression (fold change >1.5; Fig. 4b). 
Most genes were enriched only for a subset of contiguous ages at early, 
middle, and late stages, while few genes showed persistent laminar 
enrichment (Fig. 4b, c and Supplementary Table 7). Sets of laminar 
genes tile smoothly across development, suggesting a graded emergence 
of functional characteristics of neuronal subtypes, as well as shifting 
proportions of subtypes within layers. Genes with early laminar expres- 
sion are enriched for GO terms related to early developmental events 
such as neuron differentiation (P= 1.6 x 107’) and axon generation 
(P=6.7 x 10°), whereas later laminar genes are enriched for axon 
guidance (P=6.5 x 10°) and innervation (P=9.8 x 10~°). 

Different anatomical and functional neocortical areas have dis- 
tinct gene expression patterns in the adult, most prominently in V1 
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(for example, E50 compared to E40) that are shared with other ages and 
regions. Note large overlap between all regions at each age and within 
prenatal and postnatal periods. Arrows highlight region-specific blocks of 
coordinated expression. AM, amygdala; BG, basal ganglia; HP, hippocampus; 
FCx, frontal; OCx, occipital; PCx, parietal cortex; TCx, temporal cortex; 
THAL, thalamus. c, Timing of neuronal connectivity-related events revealed 
by enrichment of GO terms in overlapping gene lists from b. 

(refs 15, 16, 28). During development, intrinsic (protomap)”? and 
extrinsic (protocortex)*? factors shape cortical area identity. To ask 
whether the temporal dynamics of areal expression support intrinsic 
versus extrinsic areal specification, we first identified genes differ- 
entially expressed between caudal (V1, blue) and rostral (ACG, red) 
cortex in different layers at each developmental stage (Fig. 4d and 
Supplementary Table 8). More genes were differentially expressed 
between cortical areas in postmitotic layers during late postnatal than 
during prenatal development. These genes were enriched for cell adhe- 
sion (P=3.9 x 10~°) and synaptic transmission (P=7.3 x 10~*) GO 
terms, suggesting a role for extrinsic factors related to the functional 
maturation of cortical circuitry. Developmental process timing varies 
across primate cortex, with neurogenesis occurring earlier in frontal 
than caudal cortex, but circuit maturation persisting later in frontal 
cortex®®, and this was recapitulated by an even more protracted emer- 
gence of enriched genes in ACG (12 to 48 months) compared to V1 
(3 to 12 months). 

To test for genes that may contribute to intrinsic specification, we 
searched for genes with persistent areal enrichment beginning before 
the arrival of thalamic*! or other extrinsic inputs. Although 38% of genes 
showed regional differences, only 1% were persistent over development 
(Supplementary Table 8). These genes were primarily expressed in post- 
mitotic cortical layers, for example CBLN2 (ref. 17), and only 20 genes 
(0.2%) were expressed in germinal zones, including the well-known 
caudal-to-rostral gradient genes FGFR3 (ref. 17, 29) and NR2F1 (also 
known as COUP-TFI)”’. Thus, although there is evidence for a small 
number of genes clearly reflecting early intrinsic areal specification, 
there are many more adult differences that could be due to extrinsic 
interactions and activity-dependent mechanisms”. 

Some areal differences are probably due to differences in the pro- 
portion and maturation state of specific cell types, such as dividing 
neural progenitors and postmitotic neurons and glia. We identified 
spatiotemporal locations with significant enrichment in V1 or ACG 
for markers of cell cycle (that is, actively dividing progenitor cells), 
GABAergic and glutamatergic neurons in adult mouse V1, and astro- 
cytes (Fig. 4e-h). Cell cycle markers were enriched in V1 subventricular 
zone from E80 to E90 (Fig. 4e), reflecting extended generation of super- 
ficial excitatory neurons in V1 relative to ACG. Likewise, markers for 
excitatory neurons (Fig. 4g) were enriched in ACG prenatally, corre- 
sponding to earlier generation and maturation of these cells. Postnatal 
enrichment in V1 is probably secondary to using markers derived 
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Figure 3 | Variable onset of biological processes between brain regions. 
a, Prolonged duration of biological GO processes. Number of ages at which 
increasing and decreasing GO processes are active in each region (colours) 
or in any region (black). b, Synchronous and asynchronous onset of GO 
processes between regions, mainly beginning in prenatal development. 
Left: age of onset for all increasing GO processes ordered by onset in 
subcortical and then cortical regions (AM, BG, HP, then neocortex 
(NCX)). Right: GO processes predominantly begin earlier in subcortical 
than cortical regions. c, Early onset (arrow) of mature oligodendrocyte 
marker MOBP in subcortical globus pallidus (arrow) compared to cortical 
myelin-enriched regions. Other myelin genes such as OPALIN show 
synchronous developmental timing. CA lor, hippocampal CA1 stratum 
oriens; V1 iz/wm, intermediate zone/white matter; ic, internal capsule. 


from adult mouse V1. Astrocyte marker enrichment in prenatal ACG 
progressed from proliferative to postmitotic layers (Fig. 4h), tracking 
astrocyte generation and migration in these layers. Finally, markers of 
inhibitory GABAergic interneurons showed early enrichment in V1 
subventricular zone (SZ), followed by ACG enrichment in several layers 
(Fig. 4f), a potential signature of early interneuron migration from 
medial ganglionic eminence to caudal cortex. 

The majority of areal genes showed intermediate expression in 
primary somatosensory cortex (S1), located approximately midway 
between rostral ACG and caudal V1 (Extended Data Fig. 6). Therefore, 
rostro-caudal expression gradients, rather than specific features of ACG 
and V1, probably drive many of these areal expression differences. 


Mapping disease genes to development 

Highly heritable but genetically complex neurodevelopmental disorders 
will be more tractable if we can identify common pathways linking many 
candidate risk genes. For example, recent work on autism spectrum 
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disorder (ASD) demonstrated that candidate genes are co-expressed 
in mid-fetal human cortical neurons'”*?4 and converge on synaptic 
development pathways. Large-scale genetic association studies have 
identified high-confidence risk genes associated with primary auto- 
somal recessive microcephaly (MCPH), ASD, intellectual disability, 
and schizophrenia. We localized these disease genes in developing 
cortex using this new high-resolution map. 

We used weighted gene co-expression network analysis (WGCNA) 
to identify sets of genes with correlated patterning (modules) in cortical 
samples at each age, linked modules with overlapping genes between 
adjacent ages, and annotated modules for enrichment in different 
cortical layers and cell types (Fig. 5a and Supplementary Table 9). 
Next, we tested disease gene sets for significant enrichment (hyper- 
geometric test, empirically corrected P < 0.1). As expected, MCPH 
genes showed early- to mid-fetal enrichment in neuronal progenitor- 
enriched modules (Fig. 5b), where they are positioned to alter cell cycle 
kinetics to reduce neuron numbers and brain size. Many ASD genes 
showed coordinated expression in postmitotic neurons prenatally 
(Fig. 5c), as previously reported!73334, but also postnatally. In contrast, 
schizophrenia genes were selectively enriched only in postmitotic 
neuronal modules from infancy through adulthood (Fig. 5d). 

Intellectual-disability-associated genes were not enriched in any gene 
modules, consistent with the view that disruption of many biological 
pathways can undermine cognitive development*™. To investigate 
whether these diverse molecular pathways converge on intermediate 
phenotypes within intellectual disability such as structural brain mal- 
formations, we looked for correlated expression among the 71 intellec- 
tual disability genes across prenatal and postnatal cortex. We identified 
four major expression patterns: (1) postnatal cortical plate, (2) pre- 
natal germinal layers, (3) prenatal subventricular zone and cortical 
plate, and (4) more complex patterns spanning development (Fig. 5e). 
Consistent with the module analysis above, MCPH-associated genes 
had significantly similar expression profiles (multivariate distance 
matrix regression, P < 0.001) and were enriched in prenatal germinal 
zones in clusters 2 and 3 in either the ventricular zone or subventricular 
zone. In contrast, other phenotypes did not share expression patterns, 
including cortical atrophy, seizures and even microcephaly with other 
brain malformations. Furthermore, genes with similar patterns can 
give rise to different phenotypes. ASXL3 and GRIP] are co-expressed 
(R=0.80) in cluster 3, but are associated with MCPH and corpus 
callosum agenesis, respectively. 


Human -specific developmental trajectories 
Humans exhibit unique cortical and cognitive developmental trajec- 
tories compared to other mammals? that are probably correlated with 
differences in gene expression. To identify conserved and divergent 
genes, we compared gene expression trajectories across prenatal and 
postnatal development in frontal cortex between rat, rhesus monkey, 
and two independent human data sets, normalizing timescales by 
developmental event scores!. Expression conservation varied across 
species (Kruskal-Wallis rank sum test, P < 10-7), with greater sim- 
ilarity within humans than within primates (Wilcoxon signed rank 
test, P< 10~°), as expected based on evolutionary distance (Fig. 6a). 
Surprisingly, there was no significant difference in conservation of 
rat trajectories with either human or rhesus monkey, indicating tran- 
scriptional patterning has evolved at the same average rate in human 
and rhesus monkey since our last common ancestor. On average, 69% 
of genes had conserved expression trajectories (R > 0.5) across all 
three species, and genes with the most dynamic expression change 
over development were even more highly conserved (Extended Data 
Fig. 7a, b). For example, EMX2, a transcription factor critical to cortical 
arealization*’, and CNTN1, a cell adhesion gene that guides neuronal 
migration and connectivity (Fig. 6b), are highly conserved. 

A minority (13%) of genes differed between primates and rat (vali- 
dated for a subset of genes in developing mouse; Supplementary Table 10) 
and approximately 9% had human-specific expression trajectories. 
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Figure 4 | Protracted maturation of neocortex through young 
adulthood. a, Expression patterns of genes best marking each cortical 
layer progressively change with time. Each dot represents the average 
across-layer expression correlation of the 10% of genes best distinguishing 
layers at one age (source list) with their expression levels in another age 
(comparison). Values are plotted against a scaled difference in age. 

b, Layer-enriched gene expression shows progressive and extensive change 
with age. Binary heat map showing expression of layer-enriched genes 
(rows) across ages (columns). Genes are ordered and divided based on 

the median age of laminar expression into four chronological groups: 
early (median age < E120), middle (E120-3 months), late (>3 months) 
and persistent. c, Developmental expression of representative genes from 


The proportion of genes showing different conservation patterns was 
robust to gene selection and correlation threshold (Extended Data 
Fig. 7b). Some of these differences were dramatic; for example, BMP3 
decreases over primate development but increases in rodents (Fig. 6b). 
BMP3 is a WNT signalling growth factor that has been linked to cranio- 
facial variation in dogs*®. CNTN2, a close family member of the con- 
served adhesion gene CNTN1, likewise shows opposite developmental 
trajectories in primates and rodents. In both human data sets, LGALS1 
and LIN7A increase or decrease expression, respectively, in opposite 
directions to other species. LGALS1 has a different adult laminar 
pattern of cortical expression in humans compared to mice!!, whereas 
deletion of LIN7A is associated with human intellectual disability and 
disrupted neuronal migration and axonal pathfinding in mouse”. 
Some aspects of cortical development, such as myelination® and 
synaptic pruning’, are protracted or neotenous between cortical areas 
or between species. Reasoning that abrupt changes in expression may 
represent important developmental milestones, we compared develop- 
mental timing by identifying distinct changes in expression trajectories, 
or breakpoints. A total of 179 increasing and 179 decreasing genes met 
these criteria in all three species, being well fit by segmented linear 
regression (Supplementary Table 11). Consistent with different devel- 
opmental rates of these species, breakpoints for genes with increasing 
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each chronological group in layer 6. d, Number of genes differentially 
expressed between rostral (ACG) and caudal (V1) cortex in superficial 
(L2/3), deep (L5/6) and proliferative (SZ, VZ) layers across development. 
Most differences are found in postmitotic layers during early prenatal 

and late postnatal development. e-h, Genes marking specific cell classes 
show regional enrichment consistent with earlier neurogenesis and 
gliogenesis in ACG compared to V1. Left column: enrichment of regional 
genes (d) with markers for cell cycle (progenitors) (e); GABAergic (f) and 
glutamatergic (g) neurons in V1; and astrocytes (h). Size of box indicates 
significance of cell type enrichment in a given layer, region, and age. Right 
column: expression levels (mean + s.e.) of canonical marker genes for each 
cell type in V1 (blue) and ACG (red). 


(Fig. 6c) and decreasing (Extended Data Fig. 7c) expression often 
occurred earliest in rat, at intermediate times for rhesus monkey, and 
latest in human. Almost half (81) of genes with increasing expression 
were synapse related (dashed lines, Fig. 6c), and breakpoints coincided 
with ages of peak synaptic density estimated in these species (shaded 
rectangles, Fig. 6c and Extended Data Fig. 7d). There was no signifi- 
cant difference in the breakpoints for synapse-related genes in human 
V1 versus prefrontal cortex, and only a short delay in rhesus monkey 
V1 (42 days, P=9.6 x 10~°, Extended Data Fig. 7e), consistent with 
synchronous synaptogenesis reported in primates”***°. A later study 
reported protracted synaptogenesis in human", but this study suffered 
from sparse sampling and no statistical analysis and does not appear 
consistent with the bulk of evidence. 

Although the timing of gene expression breakpoints (a change in the 
slope of a developmental trajectory) was largely conserved between 
species, particularly between rhesus monkey and rat (Extended 
Data Fig. 7f), human breakpoints were clearly bimodally distributed 
(Fig. 6c), suggesting that some genes had different developmental 
timing. For example, OLIG1 is expressed in maturing oligodendrocytes 
and had a late breakpoint relative to rhesus monkey (Fig. 6d), consistent 
with prolonged myelination in several human cortical areas compared 
to rhesus monkey and chimpanzee®. Surprisingly, many more genes 
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Figure 5 | Spatiotemporal localization of disease-specific associations 
in developing cortex. a, WGCNA gene co-expression networks (modules; 
Supplementary Table 9) based on pooled V1 and ACG samples (n = 603) 
analysed independently at each age (columns). Modules were tested for 
significant gene set enrichment and overlap with hypergeometric tests. 
Modules significantly enriched for markers of glial cell classes (P < 107°) 
or cortical layers (P < 10°) are colour-coded and annotated. Remaining 
modules are coloured or labelled based on maximal expression in 
postmitotic (neuron-enriched; cyan) layers or progenitor or largely non- 
neuronal (WM, layer 1; orange) layers. Modules from adjacent ages with 
the most highly significant gene overlap (P < 10 -*°) are connected by grey 
lines. b-d, Left: modules significantly enriched for risk genes associated 
with neurodevelopmental disorders (empirically corrected P < 0.1; red 
discs). Right: average expression pattern of genes found in at least two 
enriched modules. b, Genes related to primary autosomal recessive 


had an early rather than late breakpoint in human prefrontal cortex 
(Fig. 6e), in contrast to previous work! showing human-specific 
delayed peak expression in this brain region. However, we found that 
many early genes continued to increase expression through adulthood 
(Fig. 6e), such as SYT7 (Fig. 6d), a presynaptic calcium-binding protein 
important for modifying neurotransmitter release”. In rhesus monkey, 
gene expression often did not change significantly after breakpoints. 
Genes with an early breakpoint and late maximal expression may 
mark developmental processes that are protracted in human relative 
to non-human primates and that underlie our extended cognitive 
development. Interestingly, we found this transcriptional signature of 
neoteny between species in both prefrontal cortex and V1, but much 
less so in evolutionarily older structures including hippocampus, amyg- 
dala and striatum (Fig. 6e and Extended Data Fig. 7g). 


microcephaly are enriched in early non-neuronal modules, and show 
maximal expression in prenatal ventricular zone. c, Genes related 

to autism spectrum disorder are enriched in modules associated with 
cortical neurons, and show highest expression in cortical plate across 
development. d, Genes related to schizophrenia show similar neuronal 
layer enrichment to autism genes, but restricted to postnatal ages. 

e, Left: genes related to intellectual disability (ID) are associated with 
different brain malformations and/or seizures and show four major 
expression patterns over cortical development. Right: expression heat maps 
of known and candidate disease genes representing developmental profiles 
of each cluster. Intellectual disability genes also associated with primary 
microcephaly are significantly enriched in cortical progenitor-enriched 
pattern 2 (*P < 0.001; multivariate distance matrix regression permutation 
test). CC, corpus callosum. 


Discussion 
The current project transcriptionally profiled primate brain devel- 
opment with fine anatomical detail from early gestation to young 
adulthood. Although expression rates of change decreased more than 
100-fold over development, we observed small but coherent changes 
from juvenile to adult, a period of enormous cognitive change and sus- 
ceptibility to neuropsychiatric disease. These changes related to the late 
emergence of mature laminar and areal signatures, biological pathways 
such as negative regulation of axonal pathfinding, and the appearance 
of gene modules significantly associated with ASD and schizophrenia. 
Two unexpected characteristics of expression trajectories support 
extrinsic influences on developmental transcriptional regulation. First, 
there was a striking synchrony among genes changing in disparate 
brain regions and cell populations, suggesting a mechanism for global 


21 JULY 2016 | VOL 535 | NATURE | 373 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Developmental expression similarity between species 


rf NS Rhesus vs Rhesus vs 
Ss 1.04 human rat 
3 
g e 
8 0.5 + 5 
= 7 
£ 0 x 
iy —y 
® 
= Ny 2 
= 2 
5 -05 4 
3 3 
8 g 
x _1.0- 8 
W107 : 3 
VE SF 
oY” SP EF oF 5 
FSS 2 a 
eo @ = 
RS = = & Human vs rat 
se & & 
we = 
c d 


X@UOO |e}Uds4 
Expression (z-score) 


10 100 1,000 10,000 


> 
ca 0 Age (days post-conception) 
B4 
a Late breakpoint in human 
3 s & 
5 8 4.0 ‘ ' ' 
g 8 
2 co} 
g 5 
@ 
1] \ = 3 
Jr? i 
x 
} w 
0 


10 100 1,000 10 
Age (days post-conception) 


100 1,000 10,000 
Age (days post-conception) 


Species Gene set 
Oo Human All 
Rat = 
oO Macaque Oo 1 _, Synapse 


Figure 6 | Conserved and human-specific gene expression trajectories 
in frontal cortex. a, Left: box plots of pairwise correlations between 
developmental expression trajectories (median + 25th and 75th 
percentiles, whiskers at 1.5 times the interquartile range) of orthologous 
genes profiled in rhesus monkey, rat and two human data sets. Kruskal- 
Wallis rank sum test, post hoc Wilcoxon signed rank paired tests: 

*P < 0.001 (Bonferroni-corrected), NS, not significant. Right: Venn 
diagram showing the number of conserved (R > 0.5) genes between 

each pair of species. b, Examples of conserved and species-specific gene 
trajectories. Colours and symbols are consistent in b-d. c, Distribution of 
breakpoint ages for 179 orthologues (solid lines) of which 81 were synapse 
related (dashed lines) in frontal cortex and V1. Shaded bars indicate 
periods of peak synaptic density (95% confidence intervals) in each species 
(Extended Data Fig. 7b). d, Genes with unexpectedly early (SYT7) 


regulation such as circulating hormones. Second, the surprisingly late 
acquisition of adult-like cortical areal and laminar molecular pheno- 
types points to an important role for contextual and activity-dependent 
mechanisms in sculpting mature cellular phenotypes*>. 

We show that 22% of genes have different developmental trajecto- 
ries in rat and human, comparable to 25% of genes that have different 
laminar patterns in adult mouse and human cortex!!. In contrast, 
approximately 9% of genes have different trajectories in rhesus mon- 
key and human, including genes with delayed peak expression solely in 
human cortex. Therefore, rhesus monkey is an important comparator 
for understanding human-specific features of brain development, but 
cannot fully model protracted circuit formation and associated diseases 
that are seen only in humans. 

A recently expanded set of ASD candidate genes** were significantly 
enriched in newborn neurons during prenatal development, as previ- 
ously reported!”*334, but also enriched throughout postnatal develop- 
ment. Schizophrenia risk genes were also enriched in neurons but not 
until infancy, suggesting a larger role for dysfunction in circuit refine- 
ment than prenatal processes such as neurogenesis. Schizophrenia and 
ASD were enriched in the same neuron-enriched module in infancy 
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and late (OLIG1) breakpoints in expression trajectories in human. Observed 
breakpoints (dashed lines) compared to expected breakpoints (dotted 
lines) based on timing in rhesus monkey. e, Many genes in human cortex 
have early breakpoints followed by prolonged increase in expression. Left: 
comparison of breakpoint timing (points with 95% confidence intervals) 
between human and rhesus shows a biased population of early breakpoint 
genes in human cortex but not striatum. Many of these early breakpoint 
genes continue to increase expression through young adulthood (red) 

in contrast to genes with relatively conserved timing that tend to plateau 
(grey) or decrease (blue). Ages corresponding to equivalent (solid lines) 
or nearby (dashed lines) developmental stages (event scores within + 0.2; 
see Methods) in human and rhesus monkey are shown. Right: bar plot 
summarizing the number of genes that have an early or late breakpoint in 
human in different brain regions. 


despite only 5% overlap in candidate genes, pointing to a shared aeti- 
ology that is being increasingly appreciated*®. 

This data resource (http://www.blueprintnhpatlas.org and http:// 
www.brain-map.org) has many potential applications. For example, 
it could be used to establish an in vivo baseline against which to com- 
pare the identity and maturity of in vitro stem-cell-derived neurons”. 
Recent technical advances enable profiling the full transcriptome!?!, 
epigenome’’, and ultimately proteome of single cells. These techniques 
promise to refine this broad survey to a causal understanding of molec- 
ular programs driving the complete lineage of primate brain cells and 
the maturation of specific neuron types in functional circuits. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size, and the exper- 
iments were not randomized or blinded. More complete descriptions of the 
experimental and data processing methods are available in protocol documents 
at the NIH Blueprint NHP Atlas website (http://www.blueprintnhpatlas.org; 
documentation tab). 

Animals. Frozen post-mortem tissue samples from prenatal rhesus macaque 
(Macaca mulatta) of predominantly Indian origin (Extended Data Table 1) were 
provided by the Time-Mated Breeding Program at the California National Research 
Primate Center (CNPRC; http://www.cnprc.ucdavis.edu). Prenatal brain material 
was acquired following fetectomy using standard operating procedures (SOPs) at 
the CNPRC. Timed pregnancy derived biological replicate specimens (2 males, 
2 females) at each of six prenatal developmental stages (E40, E50, E70, E80, E90, 
and E120) were profiled. These time points were selected to coincide with peak 
periods of neurogenesis for the different layers of primary visual cortex based 
on birthdating experiments”*. Frozen postmortem tissue samples from postnatal 
rhesus macaque were also provided by CNPRC. Brain regions were systematically 
collected from well-characterized rhesus monkeys born and raised at the CNPRC 
in outdoor, half-acre enclosures that provide a naturalistic setting and normal 
social environment. For transcriptome analysis, three male specimens at each of 
four postnatal developmental stages representing the neonate (0 months), infant 
(3 months), juvenile (12 months) and post-pubertal adult (48 months) were 
profiled. Extensive health, family lineage and dominance information were main- 
tained on all animals. All procedures were approved by the Institutional Animal 
Care and Use Committee (IACUC) at UC Davis. 

Postnatal MRI and histological reference series. An interleaved anatomical atlas 
of magnetic resonance imaging (MRI), histology, and block face photographs was 
created from four male rhesus macaques at 2 weeks, 3, 12 and 48 months after birth 
as described in the ‘reference series’ protocol document available at (http://www. 
blueprintnhpatlas.org). In addition, ISH data were generated serially across com- 
plete hemispheres of three post-pubertal adult (48 months) male specimens. Genes 
were selected to mark specific cell populations and cortical areas and to represent 
gene families important to neural function and associated with neuropsychiatric 
disease and brain evolution. Several histological stains (Nissl, acetylcholinesterase 
and SMI-32) of complete hemispheres were combined with ISH data to provide 
an unprecedented cellular-resolution view of gene expression in the whole post- 
pubertal adult brain. High-throughput colorimetric ISH methods are described 
in detail elsewhere” and in the ‘in situ hybridization’ protocol document available 
at (http://www.blueprintnhpatlas.org). 

Tissue processing. Following collection, brain tissues were partitioned in a man- 
ner dependent on specimen stage, gradually frozen, and then stored at —80°C 
until processing. For a subset of E40 specimens, following specimen collection, the 
calvarium was frozen intact in an isopentane/dry ice slurry maintained at —40°C 
to —45°C, gradual freezing at a steady rate. For the majority of E40 specimens, the 
specimens were embedded in OCT (optimal cutting temperature compound) dur- 
ing the freezing process. In brief, chilled OCT was placed around the calvarium. A 
disposable embedding chamber was filled with approximately 5 mm} chilled OCT. 
The specimen was carefully oriented and centred in the OCT, posterior surface 
down in the OCT. Then, the specimen was aligned along the medial/lateral axis 
using the bilateral ocular fiduciaries as a frame of reference. Next, the specimen 
was aligned in the coronal plane. After alignment along all axes, OCT was added 
to encase the specimen in its entirety. The top of the specimen was covered with 
approximately 3mm of OCT. The chamber containing the specimen was directly 
placed onto a level bed of dry ice. The specimen and OCT were allowed to freeze 
completely. After demarcation of the orientation of the brain in the OCT block, 
the frozen tissue block was stored at —80°C. 

For the E50 specimens, following removal of the brain from the skull, the 
whole brain was frozen intact in an isopentane/dry ice slurry maintained at 
—40°C to —45°C, gradual freezing at a steady rate. For a subset of E50 speci- 
mens, the specimens were embedded in OCT during the freezing process. In 
brief, chilled OCT was placed around the intact brain. Freestanding aluminium 
bars were assembled onto a Teflon coated plate and sized to the appropriate spec- 
ifications for the E50 brain. The internal chamber was filled with approximately 
5 mm} chilled OCT. The specimen was carefully oriented and centred in the 
OCT dorsal surface down in the OCT. Then, the specimen was aligned along the 
medial/lateral axis using the longitudinal fissure as the frame of reference. Next, 
the specimen was aligned in the coronal plane. After alignment along all axes, 
OCT was added to encase the specimen in its entirety. The top of the specimen 
was covered with approximately 3 mm of OCT. The Teflon plate containing the 
specimen was directly placed onto the level bed of dry ice. The specimen and 
OCT were allowed to freeze completely. After demarcation of the orientation of 
the brain in the OCT block, the aluminium bars were removed and the frozen 
tissue block was stored at —80°C. 


For the E70, E80, E90, and E120 specimens, the hemispheres were bisected along 
the midline and individually frozen by placing the medial aspect of each hemi- 
sphere down onto an aluminium-Teflon coated plate that was slowly lowered into 
an isopentane/dry ice slurry maintained at —40°C to —45°C. Only approximately 
a third of the tissue was submerged in the slurry to allow the tissue to gradually 
freeze and to keep freezing artefacts to a minimum. Frozen hemispheres were 
stored at —80°C. 

Depending on the prenatal time point, different approaches were taken for 
generating coronal slabs. When possible, the number of slabs per specimen was 
kept to a minimum. The E40, E50, and E70 specimens were not slabbed. For E80, 
the first slab contained up through the temporal pole and the second slab contained 
the occipital pole. For E90, the first slab contained the frontal lobe anterior of the 
temporal pole and the second slab contained temporal pole posterior through the 
occipital lobe. For E120, three coronal slabs were made. The first slab consisted 
of the frontal lobe anterior of the temporal pole. The second slab consisted of the 
temporal pole posterior to the cerebellum including all of the mid-brain. The third 
slab included primarily the occipital lobe. 

For postnatal brains, after dissection brains were sectioned into coronal slabs 
approximately 1 to 1.5 cm in thickness and the left hemisphere was prepared for 
sectioning onto microscope slides for ISH. Structures for microarray analysis were 
isolated from the right hemisphere slabs, and these samples were then frozen at 
—80°C until processed further. 

Laser microdissection and RNA isolation. Tissue spanning five anatomically 
distinct brain regions—frontal cortex, visual cortex, hippocampus, striatum, and 
amygdala (Fig. 1 and Supplementary Table 1)—was selected from each specimen 
and processed for further thin sectioning and LMD using a standard protocol. 
Specifically, frozen tissue was cryosectioned at 141m onto polyethylene naphtha- 
late (PEN) slides (Leica Microsystems, Bannockburn, IL) and a 1:10 Nissl series was 
generated for neuroanatomical reference for all prenatal time points. In addition, 
for the E40, E50, E70, E80, and E90 time points, a 1:10 GAP43 and 1:10 ENC1 
in situ hybridization (ISH) series was generated for neuroanatomical reference, 
as they often clearly delineate nuclei and layers at early developmental stages. For 
E120, a 1:10 acetylcholinesterase series was generated for neuroanatomical refer- 
ence. In total, 127 transient and terminal anatomical structures were isolated using 
this strategy°?*?. After drying for 30 min at room temperature, PEN slides were 
frozen at —80°C. Slides were lightly Nissl stained with cresyl violet to allow cytoar- 
chitectural visualization. Slides were fixed in ice-cold 70% ethanol for 30s, washed 
for 15s in nuclease-free water, stained in 0.7% cresyl violet in 0.05% NaOAc, 
pH 3.4 for 2 min, nuclease-free water for 15s, followed by 15s each in 50%, 75%, 
and 95% ethanol, followed by 20s in 100% ethanol, and then a final 100% ethanol 
wash for 25 s. Slides were air-dried for 2 min, and desiccated in a vacuum for 1h, 
then frozen at —80°C until microdissection. Laser microdissection was performed 
ona Leica LMD6000 (Leica Microsystems, Bannockburn, IL), using the Nissl stain 
and GAP43 and ENC1 ISH or acetylcholinesterase histological staining as a guide 
to identify target brain regions in prenatal samples. Microdissected tissue was 
collected directly into RLT buffer from the RNeasy Micro kit (Qiagen, Valencia, 
CA) with 8-mercaptoethanol. Samples were volume-adjusted with water to 751, 
vortexed, centrifuged, and frozen at —80°C. RNA was isolated for each structure 
following the manufacturer’s directions for the RNeasy Micro kit. RNA samples 
were eluted in 14,1] and 11] was run on the Agilent 2100 Bioanalyzer (Agilent 
Technologies, Santa Clara, CA) using the Pico assay. Due to low sample volume 
and incompatibility of the eluant with the Nanodrop spectrophotometer (Thermo 
Scientific, Wilmington, DE), samples were quantitated using the Bioanalyzer con- 
centration output. This was done by running a 1 ng! RNA standard on the same 
Pico chip and then dividing the sample concentration output by the output of the 
standard concentration. For prenatal samples, 2 ng of total RNA was almost always 
used as the input amount for the labelling reaction, and the average RNA integrity 
number (RIN) of passed samples was 7.5, with RINs typically lower than 4.5 failing. 
For postnatal samples, 5 ng of total RNA was almost always used as the input, and 
the average RIN of passed samples was 7.5, with RINs typically lower than 5 failing. 
Histological stains. Niss/. For the Nissl neuroanatomical reference slides, slides 
were stored at 37 °C for 1-5 days before staining. Sections were defatted with xylene 
or the xylene substitute Formula 83, and hydrated through a graded series con- 
taining 100%, 95%, 70%, and 50% ethanol. After incubation in water, the sections 
were stained with 0.213% thionin, then differentiated and dehydrated in water and 
a graded series containing 50%, 70%, 95% and 100% ethanol. Finally, slides were 
incubated in xylene or Formula 83, and coverslipped with the mounting agent 
DPX. After drying, slides were cleaned before digital imaging. 

Acetylcholinesterase. A modified acetylcholinesterase protocol was used to assist 
in neuroanatomical delineations in E120. Unlike acetylcholinesterase staining in 
fixed tissue, staining in fresh frozen tissue does not elucidate cholinergic fibres 
commonly seen in the cortex, but instead provides demarcation of various subcor- 
tical nuclei. Acetylcholinesterase staining was performed using a direct colouring 
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thiocholine method combined with a methyl green nuclear counterstain to improve 
tissue visibility™. Glass slides with fresh frozen tissue sections were removed from 
4°C, allowed to equilibrate to room temperature, fixed in 10% neutral buffered for- 
malin (NBF) and washed briefly in ultra-pure water. Sections were then incubated 
for 25 min in a solution of acetylthiocholine iodide, sodium citrate, cupric sulphate, 
and potassium ferricyanide in a 0.1 M sodium acetate buffer (pH 6.0), washed in 
0.1 M Tris-HCl buffer (pH 7.2), and incubated with 0.5% diaminobenzidine (DAB) 
in 0.1 M Tris-HCl with 0. 03% hydrogen peroxide for 8 min. Slides were incubated 
in 0.2% methyl green, briefly dipped in 100% ethanol, cleared with Formula 83 and 
coverslipped with DPX. After drying, slides were cleaned before digital imaging. 
In situ hybridization (ISH). High-throughput colorimetric ISH methods are 
described in detail elsewhere (see ref. 49) and in the in situ hybridization protocol 
document located under the documentation tab at (http://www.blueprintnhpatlas. 
org). In brief, fresh frozen tissue sections (from either E40, E50, E70, E80, or E90) 
on slides were fixed in 4% PFA in PBS, acetylated, and dehydrated through graded 
alcohols. Endogenous peroxidase activity was blocked with 3% hydrogen peroxide 
in methanol. Proteinase K digestion at 0.0135 U ml was carried out. Digoxigenin 
(DIG)-labelled riboprobes (either GAP43 or ENC1) were hybridized at 63.5°C for 
5.5h, followed by stringency washes and a series of enzymatic reaction steps for 
detection and amplification of DIG signal. Sections were washed with EDTA, fixed 
in 4% PFA, and washed with acid alcohol (70% ethanol, adjusted to pH = 2.1 with 
12 N HCl) to reduce background signal. Slides were coverslipped with Hydromatrix, 
subjected to quality control analysis and cleaned before digital imaging. The com- 
plete list of 46 ISH gene targets for ISH in postnatal rhesus monkey is also available 
in the documentation tab at (http://www.blueprintnhpatlas.org). 

Digital imaging and processing of histologically stained sections. Digital imag- 
ing was done using the ScanScope XT (Aperio Technologies, Vista, CA). The final 
resolution of images was 1 |1m per pixel. All images were databased and preproc- 
essed, then subjected to quality control analysis to ensure optimal focus and that no 
process artefacts were present in the images. Images that passed this initial quality 
control were further assessed to ensure that the staining data were as expected. 
Once all quality control criteria were met, images became available for annotation 
of anatomic structures. 

mRNA profiling. Prenatal samples passing RNA quality control were amplified and 
profiled. Briefly, samples were amplified and labelled using a custom 2 cycle Ultra 
Low Input procedure, using components of MessageAmp II aRNA Amplification 
kit (AM1751) for the first amplification cycle (using oligodT), and components of 
MessageAmp II Biotin Enhanced Single Round aRNA Amplification kit (AM1791) 
for the second amplification cycle (using both random hexamers and oligodT). 
For prenatal samples, 2 ng of total RNA was added to the initial reaction mix 
together with 250 ng of pBR322 (Invitrogen). Following the first cycle of IVT, 
the plasmid carrier was removed with a DNasel (Qiagen) treatment. The first 
cycle IVT products were purified using the QiagenMinElute Kit (Qiagen). Input 
into the second cycle was normalized to 400 ng. Hybridization was to catalogue 
GeneChip Rhesus Macaque Genome Arrays from Affymetrix containing 52,803 
probe sets/sequences. For detailed information about this macaque microarray, 
see the Affymetrix website (http://www.affymetrix.com/browse/products.jsp?pro 
ductId=131496&navMode=34000&navAction=jump &ald=productsNav#1_3). 
Labelling and scanning were completed following the manufacturer’s recommen- 
dations. Quality control assessment failed a small number of microarray samples. 
Sample amplification, labelling, and microarray processing were performed by 
Covance in Seattle, WA. 

Postnatal samples passing RNA quality control were amplified and profiled 
as described in Winrow et al.°° with a few modifications. Briefly, samples were 
amplified and labelled using a custom 2 cycle version, using 2 kits of the GeneChip 
HT One-Cycle cDNA Synthesis Kit from Affymetrix. For postnatal samples, 5 ng 
of total RNA was added to the initial reaction mix together with 250 ng of pBR322 
(Invitrogen). In vitro transcription (IVT) for the first cycle was performed using 
a 5X MEGAscript T7 Kit (Ambion). Following the first round of IVT, the plas- 
mid carrier was removed with a DNasel (Qiagen) treatment. The first round IVT 
products were purified using the QiagenMinElute Kit (Qiagen). Input into the 
second round was normalized to 150 ng for postnatal samples. Hybridization was 
to catalogue GeneChip Rhesus Macaque Genome Arrays. Labelling and scanning 
were completed following the manufacturer’s, recommendations and profiles were 
normalized using robust multi-array (RMA). Sample amplification, labelling, and 
microarray processing were performed by Covance in Seattle, WA. 

Microarray processing and normalization. To account for differences in mRNA 
processing, prenatal and postnatal samples were first normalized separately using 
a standard procedure, and then scaled together to minimize the effects of techni- 
cal biases (for example, due to differences in mRNA processing of prenatal and 
postnatal samples) while preserving biological variance. For prenatal samples, the 
BioConductor “affy” package was used to read in the Affymetrix microarray data 
and RMA method”, consisting of background normalization, log, transformation, 
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and quantile normalization, was applied to summarize the probe level data into 
gene expression measurements for each batch. Control samples from macrodis- 
sected adult cortex were included in each batch to normalize the data across batches 
by aligning their mean gene expression values. Postnatal samples from each batch 
were preprocessed using the same RMA method as for prenatal samples. ComBat 
(http://www.bu.edu/jlab/wp-assets/ComBat/)*” was then applied to reduce more 
severe non-biological batch-to-batch bias. The postnatal microarray data are 
unchanged since the original public release. Finally, the normalized prenatal data 
was scaled to postnatal data by sorting probes by average expression across devel- 
oped regions in E80 and older animals, finding the difference in average expression 
levels (postnatal - prenatal) of each group of 100 probes, and then scaling each 
probe by the corresponding value. 

The purpose of normalization was to minimize the effects of these non- 
biological biases while keeping biological variance intact so that within and across 
brain comparisons primarily reveal differences and similarities that are biologi- 
cally relevant. As a data-driven quality control process, for each batch, we applied 
clustering/MDS to detect any outlier in the batch by checking whether samples 
from the same structure/age were grouped together. IAC (inter-array connectivity) 
was also calculated to numerically measure how one microarray was similar to the 
other microarrays in the batch. The same quality control process was applied over 
multiple batches to identify outliers. After normalization and outlier removal, a 
total of 1,212 prenatal and 724 postnatal samples remained. 

Probe set filtering and gene symbol assignments. From the 52,865 probe sets 
included on the microarray, we identified 12,441 high-confidence probe sets 
mapping to unique genes (Supplementary Table 2). First, to assess the targeting 
specificity of each probe set, we pooled mRNA from 20 prenatal and 20 postnatal 
discrete anatomical and temporal populations and ran these mRNA pools both on 
Affymetrix microarrays and using RNA-seq as previously described**. Since a high 
quality transcriptome was not available for rhesus macaque, we used GSNAP to map 
RNA-seq reads to the rheMac2 (January 2006) genome and identified expression 
values for 50,648 putative transcripts. We used Galaxy (http://usegalaxy.org)°?-°! 
to intersect rheMac2 genomic coordinates of probes (Affymetrix annotation ver- 
sion 34; 30 January 2014), GSNAP transcripts and NCBI rhesus gene information 
(downloaded 6 February 2014). 34,646 probe sets mapped unambiguously to 
15,577 genes (rhesus Entrez IDs) and were further considered. For each probe set, 
we calculated the correlation between probe set intensity and FPKM values for all 
RNA-seq transcripts corresponding to the same gene and retained the most highly 
correlated probe set for each gene. We omitted 3,136 genes that did not have any 
probe sets significantly correlated (FDR <0.1) between methods. We annotated 
the final set of 12,441 genes with 10,715 human and 10,403 mouse orthologues 
using NCBI Homologene (downloaded 7 January 2014). 

Comparing gene expression across neocortex. Heat maps were independently 
generated for each gene by mapping expression levels in neocortex to a colour 
vector of white (low expression) to red (high expression). Samples were organized 
in a grid corresponding to layer (vertical axis, from ventricular to pial surface), 
age (horizontal axis, from young to old), and cortical region (ordered rostral to 
caudal within each age). Each sample is labelled on the axes as well as within the 
corresponding boxes, with missing data points (either due to incomplete sampling 
or if a structure does not exist) left blank. The size of each box is chosen for clarity 
and is unrelated to gene expression. 

We calculated the correlation between all neocortical samples based on the 
expression correlation of 12,441 genes. We converted correlations into distances with 
the transformation 1 — (0.5 x 1 + correlation), applied metric multi-dimensional 
scaling using the cmdscale R function to represent these distances in two dimen- 
sions, and estimated per cent variance explained for the first two principal coordi- 
nates. We plotted samples with point colour representing cortical layer and point 
size representing age. For clarity, samples that were annotated as cortical plate were 
assigned to postmitotic layers (L2-L6) that were most transcriptionally similar. We 
repeated this analysis for two subsets of samples: prenatal cortical layers and V1 
postmitotic layers (layers 2-6). 

Sex differential expression. Brain samples with data from both male and female 
donors included all prenatal ages and were included in a linear mixed model with 
sex as a fixed effect and brain region, age and donor as random effects. One model 
was fit for each gene individually using the “nlme” R package and P values were 
estimated using Satterthwaite’s approximation to degrees of freedom as imple- 
mented in the “ImerTest” R package (see Supplementary Table 3). A permuta- 
tion based correction for multiple comparisons was then performed. For each 
of 6 prenatal ages, donor sex labels (2 males, 2 females) were shuffled to generate 
64 permuted data sets. For each of these permutations, an identical linear mixed 
model was fit to all genes, and genes were ordered by the magnitude of the fixed 
effect of sex. Median and 95% confidence interval (2.5% and 97.5% quantiles) 
were calculated for each rank ordered fixed effect. Finally, a quantile-quantile plot 
(Extended Data Fig. 3) was created to compare observed fixed effects to median 
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expected (permuted) effects. 11 genes had sex effects lying outside of the 95% con- 
fidence interval (greater expression in males), and 7 of these genes were nominally 
significant (mixed model P value <0.05). 

For probes that showed differential sex expression greater than expected by 
chance or for Y chromosome probes, target sequences were downloaded from 
Affymetrix (https://www.affymetrix.com/analysis/index.affx). These sequences 
were compared to a primate nucleotide database using NCBI Blastn®, and the 
best match (query coverage >80%, sequence identity >90%) was reported in 
Supplementary Table 3. Note that 5 of 6 Y chromosome probes probably measured 
expression of highly similar paralogues on the X chromosome or autosomes, a 
consequence of the Affymetrix rhesus macaque microarray having been developed 
several years before the rhesus monkey Y chromosome was sequenced”. 
Analysis of dynamic genes using GO. We first calculated average gene expres- 
sion in all brain regions (n = 3-4 monkeys) at 10 ages, and then calculated rates 
of expression change as the difference in logo-transformed expression between 
adjacent ages in each region and expressed as doublings or halvings per year. Note 
that rates of expression change could not be calculated for the earliest time point 
(E40) or for regions without sampling at adjacent ages. For each of 419 distinct 
brain regions, we rank ordered genes by rates of expression change, and selected 
the top 1,000 increasing and decreasing genes. We compared all of these genes lists 
(419 x 418 = 175,142 comparisons). For each gene list comparison, we found the 
number of overlapping genes and calculated an overlap similarity score based on 
the weighted rank order of the lists ((OrderedList’ package in R). We then generated 
a null distribution of similarity scores by shuffling gene lists 10 million times and 
calculated P values for observed similarity scores. We found that 56,540 gene lists 
were significantly (Bonferroni corrected P < 0.05) overlapping. 

We visualized the number of overlapping genes as a heat map and reordered 
rows and columns based on the correlated patterns of overlapping increasing 
genes. Specifically, we calculated the distance between 419 brain samples as 
(1 - correlation) / 2 and used unsupervised hierarchical clustering with complete 
linkage to order samples. The dendrogram was reordered (while preserving the 
tree structure) by weighting branches by median ages of samples and rotating 
branches to group similar ages. Samples from the same age were highlighted with 
grid lines aligned to this dendrogram (Fig. 2b). The heat map was also annotated 
with brain structure and a mitotic index approximated by mean expression of 
MKI67 (Ki-67 protein). 

We performed GO enrichment with the Bioconductor ‘GOstats’ package in R 
with a background set of 10,715 human orthologues included in this study. We 
repeated GO enrichment analysis on each of 56,540 gene lists (32,053 increasing 
and 24,487 decreasing), and found 11,899 biological process terms were nominally 
significant (P < 0.05) for at least one list. To reduce false positive associations, we 
calculated the level of statistical support for each GO term as the proportion of 
children of that term (based on the GO hierarchy) that were nominally enriched. 
For example, ifa GO term had 10 associated children and four were enriched, then 
the term had a support of 0.4. For each term, we plotted a heat map of statistical 
support for all 56,540 gene lists (on a scale of 0 to 1), and many of these heat maps 
showed distinct patterns of enrichment (for example, Fig. 2c and Extended Data 
Fig. 4). We summarized GO enrichment as a table by calculating the average 
support for all 419 regions and considered a GO term active if mean support was 
greater than 0.01. We transformed this table into a binary matrix showing when 
and where GO terms were active. We removed terms associated with fewer than 
10 of 419 regions, as these terms probably represented noise, and this resulted in 
3,704 processes increasing and/or decreasing over development (Supplementary 
Table 4). 

We repeated the previous gene overlap and GO enrichment analyses with a 

set of five major brain regions including neocortex, hippocampus, basal ganglia 
and amygdala at nine ages and thalamus at four ages (40 total brain regions). We 
found that 556 of 1,600 gene list comparisons were significantly overlapping and 
1,012 biological processes were active in these regions (Supplementary Table 5). 
For each GO term, we calculated commonality as the number of ages (maximum 
nine) that term was active in each brain region and in any region. We plotted the 
distribution of commonalities across all terms in different regions as density line 
plots separately for increasing and decreasing processes (Fig. 2d). 
Relative timing of developmental processes. We calculated the relative timing 
of GO term activity using the binary matrix representing when and where terms 
were active in the four brain regions (neocortex, hippocampus, basal ganglia and 
amygdala) for which we had information across development. For each of six pos- 
sible region comparisons, we divided the number of ages a GO term was active 
in both regions by the number of ages it was active in either region (for example, 
Extended Data Fig. 5a). For each term, we averaged the six age overlap calculations 
to summarize overall timing differences. We repeated these calculations for all 
1,012 GO terms and plotted the distribution of age overlaps for increasing and 
decreasing processes (Extended Data Fig. 5b). 


For 837 increasing processes, we plotted the earliest age of activity in four brain 
regions and ordered terms by age of onset in amygdala, basal ganglia, hippocampus 
and neocortex (Fig. 3c). We characterized timing as either simultaneous (equal age 
of onset in all regions), subcortex early (either amygdala or hippocampus onset 
earlier than both hippocampus and neocortex), or cortex early (either hippocam- 
pus or neocortex onset earlier than both amygdala and hippocampus). Next, for 
each term, we rank ordered the four brain regions based on age of onset. Ties 
were broken by assigning the average rank to all tied regions. For example, if a 
process was first active at E50 in amygdala and at E70 in the other three regions, 
then amygdala was assigned a rank of 1 while the other regions were assigned a 
rank of 3 (average of ranks 2, 3 and 4). This tie-breaking method highlighted terms 
that were earlier and later than all other regions. We calculated region ranks for 
837 terms and plotted a heat map of the proportion of terms with each rank for each 
region (columns sum to one in Extended Data Fig. 5c) and used these proportions 
to calculate the weighted mean rank of each region. 

We selected the top 40 genes enriched in myelinating oligodendrocytes in adult 

mouse® and identified 23 orthologues with expression data in rhesus. Six genes 
(MOG, MOBP, ERMN, MAL, ASPA and OPALIN) were highly correlated (R > 0.9) 
with each other across all brain regions over development (Extended Data Fig. 5d). 
We calculated average expression of these genes (n= 3-4 subjects) in the highly 
myelinated regions that were sampled in this study. 
Transcriptional signatures of V1 layers. We compared gene expression across 
postmitotic layers in V1 for each age starting from E80, the earliest age at which all 
layers were sampled. Samples from L2 and L3 were considered together as L2/3, and 
samples from sub-layers of L4 were considered together as L4. For each subject and 
age, we calculated the average log, scaled gene expression of layers 2/3, 4, 5 and 6, 
as well as adjacent layers marginal zone/L1 and intermediate zone/white matter 
for comparison. For each age, we fit a linear model to the expression data with a 
dummy variable coding for layer. The proportion of variance explained by each 
layer was estimated using the CAR metric in the ‘relaimpo’ R package. For each 
layer and age, we rank ordered genes based on the proportion of variance explained 
(Supplementary Table 6). 

Layer signatures between ages were compared by identifying the top 10% most 
laminar genes evenly allocated between layers (top 2.5% each in L2/3, L4, L5, 
and L6), and then calculating their average across-layer correlation between ages. 
Specifically, for each ‘source’ list of marker genes, we found the average expression 
by layer of these genes at the ‘source’ age and at every other ‘target’ age, resulting in 
two expression tables of 1,224 genes by 4 layers. Gene by gene, expression across 
layers was compared using Pearson correlation and the average correlation across 
all 1,224 genes was determined. 

Differential expression between the layer and all other layers was calculated 
using the ‘limma’ R package. Genes were considered layer-enriched if they 
explained at least 25% of the expression variance in all layers (see description of 
analysis above) and had expression >1.5-fold higher than other layers combined. 
Genes marking the same layer in at least two ages were further defined as persistent 
if they were laminar during at least 5 out of 7 ages, or early, middle, or late if their 
median age of laminar enrichment was <E120, between E120 and 3 months, or 
>3 months, respectively (Supplementary Table 7). Selected marker genes were 
visualized using heat maps as described above, but only showing the subset of data 
considered in this analysis (L1-WM in V1 from E80 to 48 months). To characterize 
the function of genes from each laminar timing group, we grouped genes from all 
layers with the same timing and performed gene ontology (GO) enrichment using 
all “Biological Process” terms with fewer than 500 associated genes. We identified 
significantly associated terms (nominal P < 10~“ and five or more genes enriched 
for at least one timing group) and visualized —logio P values using a heat map. 
Differential expression between neocortical regions. We compared gene expres- 
sion in V1 and ACG independently for each age in the outer (L2/L3) and inner 
(L5/L6) cortical layers, in the subventricular zone (SZ), and in the ventricular zone 
(VZ). First, we averaged gene expression (logarithmically scaled) within specimen 
across all relevant samples for each layer in V1 and ACG independently (that is, 
L2 and L3 in V1 were averaged in each brain). For each age and layer, nominal 
P values for differential regional expression were calculated using a paired test 
(n=3-4 monkeys), and genes with P < 0.05 and average fold change >2 were 
considered significantly enriched in ACG or V1 (Supplementary Table 8). We then 
defined persistent regional markers as genes significantly enriched in either ACG 
or V1 in at least two-thirds of the ages in which a layer was present (n > 5 in L2/L3; 
n> 6 in L5/L6; n > 4 in SZ and VZ). We visualized the expression patterns of select 
regional markers using line plots, where each point shows the average (+ standard 
deviation) logo-scaled gene expression levels of that gene in ACG (red) or V1 (blue) 
for a particular layer (specified to the right of the plot). 

We quantified regional enrichment of several cell types in each layer and at 
each age using a hypergeometric test. Specifically, we compared 242 lists of marker 
genes derived from the literature against each set of V1 and ACG enriched genes 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


described above, calculated a P value based on the significance of overlap using the 
phyper function in R. P values were Bonferroni corrected for multiple comparisons 
and visualized as boxes in a grid of layers (vertical axis) and ages (horizontal axis). 
Cell type lists were derived as follows: dividing cell markers were defined as any 
marker of any phase of cell cycle as presented in supplementary fig. 3 of Bar-Joseph 
et al.®°. Markers for excitatory neurons in adult mouse V1 were derived from Tasic 
et al.® by identifying all genes expressed in at least 2.2x as many excitatory as 
inhibitory cells (based on proportion) and that have expression in the 80th per- 
centile excitatory cell that is both 2.2 higher than the 80th percentile inhibitory 
cell and also higher than RPKM = 2.5. Inhibitory neuron markers were defined the 
same way, but swapping excitatory and inhibitory above. Finally, astrocyte marker 
genes were defined as genes which showed at least tenfold enrichment in sorted 
astrocytes compared with both neurons and oligodendrocytes in Cahoy et al.°” 

To address the issue of timing of gene expression in prenatal cortical develop- 

ment, we extended these analyses to S1 by considering the set of genes showing 
rostral enrichment during the phase in which V1 corticogenesis lags ACG 
(E70-E90), and asked to what extent these genes also show higher expression in 
S1 vs V1. We do this in two ways. First, we re-plot genes presented in Fig. 4 with S1 
also shown for direct comparison. Second, we calculate the average enrichment of 
these genes in S1 relative to V1 (using log, fold change across all samples from the 
relevant layers between E70-E90), and compare it directly to the average enrich- 
ment in ACG relative to V1 (Extended Data Fig. 6). 
Network profiling of disease genes. We used weighted gene co-expression 
network analysis (WGCNA)**® to generate 10 unbiased gene co-expression 
networks (one at each age) using cortical structures (ACG and V1) with the goal of 
quantifying the distribution of disease-related genes throughout these networks. 
Initial networks were generated using an automated strategy with the following 
function call in R: 

blockRun = blockwiseModules(datExprRun, checkMissingData = TRUE, 

maxBlockSize = 17500,power = 14, networkType = “signed”, deepSplit = 1, 

minModuleSize = 50, minCoreKMESize = 17, minKMEtoStay = 0.4, 

mergeCutHeight = 0.1, numericLabels = TRUE, verbose = 1) 
where datExprRun is the log, normalized microarray data from the top 9,331 (75%) 
most variable genes (at each time point). For postnatal time points these values 
in V1 and ACG were scaled to the mean interquartile expression of samples from 
layers 2, 3, 5, and 6. The goal of this scaling was to de-emphasize regional patterning 
differences or batch effects and maximize laminar and cell type patterning. After 
running the network (and after performing the merging steps described below) 
module eigengenes (ME; the first principal component of genes in the module) 
were calculated for each module, and all genes were reassigned to the module 
to which it is most highly correlated to the ME (referred to as the gene’s module 
membership, or KME). Genes with kME <0.4 for all modules in a given age are left 
unassigned (defined as module 0). While there is no agreed upon value for a KME 
cut-off in the literature, we have successfully used a cut-off of KME <0.4 in the 
past as the cut-off for assigned versus unassigned genes (for example, Hawrylycz 
et al. (ref. 70)). 

If the resulting network at any time point contained more than 15 modules, 
the module pair with the most highly correlated ME were iteratively merged until 
15 modules remained. Since networks are unaffected by changes in labelling, mod- 
ules were then re-ordered manually such that modules with strong enrichment 
for cortical plate neurons had low numbers, modules with strong enrichment for 
progenitor or glial cells had higher numbers, and modules between ages with a 
highly significant number of overlapping genes were ordered similarly across devel- 
opment. To do this third step, every pair of modules in networks from adjacent ages 
were compared using a hypergeometric test, and pairs with overlap significance of 
P< 10 *° were linked (using grey lines in Fig. 5). We note that several additional 
pairs of modules show significant overlap (but with P > 10~>°)—these modules 
pairs are not shown in the plot for readability and because we are interesting in 
finding only the most significant overlaps. Finally, modules which were split into 
two modules at a single age but not at both adjacent ages were merged. At each age, 
modules are numbered by row number in Fig. 5a with the top row being Module 
01 at each age. Module assignments for each gene at each age are provided in 
Supplementary Table 9. 

We have broadly annotated many of these modules based on cortical layer, cell 
type, and disease enrichments, and provide the annotated network as a resource 
to the community. Module assignments of postmitotic vs progenitor cell layer 
in prenatal neocortex was done by comparing mean expression in cortical plate 
(CP) and subplate (SP) vs all other layers, and noting which mean value is highest. 
Similarly, assignments of cortical vs glial cell layers in postnatal neocortex was 
done by comparing mean expression in postmitotic layers 2-6 vs white matter and 
layer 1. Module assignments based on cortical layer were performed by finding the 
average expression of all module genes in all layers included in them and finding 
the layer in which expression is highest (select layer enrichments are labelled in 
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Fig. 5a). Additional assignments for primarily postmitotic cortical layers (from E80 
to 48 months) were found by performing a hypergeometric test on every module 
against layer markers identified in the analysis in Fig. 4b for each age. Significant 
enrichments (P < 10~*°) are colour-coded in Fig. 5a. This P value cut-off ensured 
that modules selected were only enriched with markers for a single layer, whereas 
with lower cut-offs, some modules had markers for multiple layers. Module enrich- 
ments for astrocytes and oligodendrocytes were calculated by performing hyper- 
geometric test comparisons, and plotting these P values as coloured discs in Fig. 5a 
for modules that are highly significant (P < 10~'°). Astrocyte marker genes were 
defined as genes which showed at least tenfold enrichment in sorted astrocytes 
compared with both neurons and oligodendrocytes in Cahoy et al.” (likewise for 
the oligodendrocyte markers). 

We used empirical rather than Bonferroni correction of the significance of 
disease gene enrichments because the tests were correlated due to high module 
gene overlap. Hypergeometric P values were calculated by comparing each 
module against genomics-based disease genes for autosomal recessive primary 
microcephaly (13 genes)’!, autism spectrum disorder (a posteriori probability >0.8; 
171 genes)** and schizophrenia (206 genes)”* with homologues in this data set. 
Module assignments were then permuted 999 times per disease, and the minimum 
P value across all modules for each of the permuted comparisons was recorded. 
Nominal P values were corrected with this empirical null distribution of minimum 
P values. For example, a nominal P value less than the minimum P value in 90% of 
the permutations had an empirically corrected P value equal to 0.1 that controlled 
the family-wise error rate at 10%. This is a much more stringent criterion than FDR 
<0.1 because it means that there is a less than 10% chance that any of the enriched 
modules is a false positive enrichment. Modules that were significantly enriched 
for disease-associated genes were plotted as red discs in Fig. 5b-d. For each 
disease, expression heatmaps for genes present in at least two enriched modules 
were plotted to show different developmental profiles. 

A recent study of intellectual disability with and without brain malforma- 

tions in consanguineous families identified 89 known and candidate disease 
related genes’*. For each pair of 71 genes profiled in this data set, we calculated 
the Pearson correlation of expression in all ACG and V1 cortical samples across 
development. We used hierarchical clustering of the Euclidean distance of corre- 
lations to group genes with similar spatiotemporal expression profiles into 4 major 
clusters and visualized correlations as a heat map with the “pheatmap” R package. 
Genes were annotated with colour bar tracks indicating phenotypes present in one 
or more individuals with likely pathological mutations in these genes. Primary 
microcephaly was defined as microcephaly without other brain malformations 
and corpus callosum abnormalities included agenesis and various hypogenesis 
and dysgenesis phenotypes. Multivariate distance matrix regression (MDMR) 
was used to test whether each phenotype explained a significant proportion of 
variation in the expression based distance matrix, and P values were calculated 
based on a permutation test as previously described”*. Expression heat maps for 
a representative gene from each cluster were plotted to show different develop- 
mental profiles. 
Conservation of developmental expression trajectories. Developmental expres- 
sion data were downloaded for rat frontal cortex’® and five human brain regions 
(medial frontal cortex, MFC; primary visual cortex, V1; hippocampus; amygdala; 
and striatum)!° homologous to regions profiled in this study. To compare rhesus 
monkey to gross cortical samples in rat and human, we calculated average expres- 
sion across all sub-regions of anterior cingulate gyrus (ACG, partially overlapping 
with MFC in human), V1, hippocampus, amygdala and striatum at each age. We 
aligned ages across species by estimating event scores based on the conserved timing 
of developmental events! and found that prenatal and postnatal development were 
well sampled in all species. 

For frontal cortical samples, 4,125 orthologous genes were identified using 
NCBI Homologene, and we selected 2,062 genes that were among the top 50% 
most variable genes over development in rhesus. This filter removed genes that 
had relatively constant expression and therefore would be difficult to compare 
trajectories between species. Independently for each gene and species, we estimated 
the shape of expression trajectories by smoothed local linear regression (“loess” R 
function) with degree zero to allow for complex dynamics while reducing overfit- 
ting. With these smoothed fits, we estimated expression at 8 approximately evenly 
spaced event scores (ranging from 0.27 to 1.32) for each species. We correlated 
expression trajectories between each pair of species and plotted a Venn diagram 
showing the number of conserved (Pearson correlation R > 0.5) genes between 
each pair of species. Genes were assigned to one of five sets based on the pattern 
of species conservation. ‘Conserved’ genes were conserved between at least two 
species pairs and ‘not conserved’ were not conserved between any species pair 
and may represent noise. ‘Primate-specific genes were conserved between human 
and rhesus but not between rat and either primate. ‘Human-specific’ genes were 
conserved between rhesus and rat but not between human and either other species. 
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‘Rhesus-specific’ genes were conserved between human and rat but not between 
rhesus and either other species. 

Next, we downloaded developmental expression profiles from an independ- 
ent human data set (dorsolateral prefrontal cortex, dIPFC, GEO accession code 
GSE30272)!*. For 1,848 of 2,062 variable genes with dIPFC data, we calculated 
Pearson correlations of expression trajectories between human MFC and dIPFC 
as described above. Expression correlations between all pairs of species were com- 
pared with box plots (median + 25th and 75th percentiles, whiskers at 1.5 times 
the interquartile range, and outliers plotted as points). Correlation distributions 
deviated significantly from normality based on a Shapiro-Wilk test (“shapiro.test” 
function in R), and so differences were tested with a Kruskal-Wallis rank sum test 
(“kruskal.test”) followed by paired Wilcoxon signed rank tests (“wilcox.test”) to 
test for significant differences in conservation between species. Finally, we repeated 
the analysis described above for a larger set of 8,683 orthologous genes in human 
and rhesus macaque. First, we rank ordered genes based on the standard deviation 
of expression across rhesus monkey development. For the top 100 most variable 
genes, we calculated expression correlations between human data sets and between 
human and rhesus. We repeated this analysis 86 times, each time including the 
next set of 100 most variable genes. Finally, we plotted the median correlation 
within and between species as a function of developmental expression variability. 

We downloaded expression data (average ISH density) for 2,024 genes in the 
dorsal pallium (which includes frontal cortex) of developing mouse”? (http:// 
developingmouse.brain-map.org/) using the Allen Brain Atlas Application 
Programming Interface (API) and calculated Spearman correlations between 305 
mouse and rat orthologous genes. All pair-wise species correlations are included 
in Supplementary Table 10. 

We used DAVID’””’* v6.7 (https://david.ncifcrf.gov/home.jsp) to test different 

sets of conserved genes for significant enrichment in gene ontology categories, 
tissue specificity and disease association relative to a background set of all 1,848 
variable genes. Gene paralogues were downloaded from Ensembl with BioMart”. 
We checked genes for evidence of positive selection on different evolutionary 
branches using Selectome*”*! (http://selectome.unil.ch/). 
Developmental breakpoint analysis. For 4,125 orthologous genes in homolo- 
gous brain regions of human, rhesus monkey and rat, we fit expression trends 
(log-transformed intensity versus age in embryonic days) using segmented lin- 
ear regression (“segmented” R package) with a single breakpoint and two slopes. 
Breakpoints were initialized at the mid-point of development, and standard errors 
for breakpoint and slope estimates were calculated using 20 bootstrap iterations 
(Supplementary Table 11). 95% confidence intervals (CI) were calculated for slopes 
based on bootstrapped standard errors (+ 1.96 x s.e.). Genes with a good model fit 
(R? > 0.8) and significantly increasing (95% CI did not include zero) or decreasing 
initial slopes in all three species were included for further analysis. Breakpoint 
densities of log;o(age) in frontal and primary visual cortical areas were plotted 
for different species for all genes and for synapse related genes downloaded from 
SynaptomeDB® (http://metamoodics.org/SynaptomeDB). For each brain region, 
species pairwise comparisons of breakpoints and standard errors were plotted and 
genes with significantly increasing or decreasing second slopes (95% CI did not 
include zero) were highlighted (Fig. 6e and Extended Data Fig. 7d, e). For each 
pair of species, ages corresponding to equivalent (that is, equal event scores) or 
nearby developmental stages (event scores within + 0.2) were plotted. Genes with 
significantly different breakpoints between species (95% CI of breakpoints did 
not overlap) were counted and summarized for each brain region as a stacked bar 
plot indicating the expression slope after human breakpoints. Breakpoint timing 
was also compared between frontal and primary visual areas in rhesus and human 
with a paired Wilcoxon signed rank test and plotted (Extended Data Fig. 7c). In 
total, expression breakpoints were identified for 179 significantly increasing and 
179 decreasing genes in human and rhesus monkey V1 and ACG. 

Synaptic density measurements were obtained from previously published 

studies for V1 in rat®’, and V1 and prefrontal cortex in human*! and rhesus”. 
Segmented linear regression was fit to synaptic density trends (percentage max- 
imum synaptic density versus age in embryonic days), and breakpoints corre- 
sponded to ages of peak density (Extended Data Fig. 7b). Breakpoint 95% CIs were 
calculated and plotted as shaded bands in density plots. 
Code availability. All results and figures from this study can be reproduced using 
R code and data files available from GitHub (https://github.com/AllenBrainAtlas/ 
DevRhesusLMD). The software is free and can be modified and redistributed 
under the terms of the GNU General Public License, version 3 (GPLv3). 
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Extended Data Figure 1 | Anatomical parcellations of developing cortical and subcortical regions. Nissl stained sections of major brain regions 
sampled in this resource. Green lines demarcate subregions that were isolated by laser capture microdissection and transcriptionally profiled. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Cortical Neural Progenitors: PAX6 


E40 E50 E70 E80 E90E120 Omo 3mo 12mo  48mo 


MZ|[oz] [wz]|[mz] [uz]fuz] [we]fwz] [uz lfue] [we lfnz] [ue tt ut fai ula] 
ve] [elle] [elfelefelelelefefepefeyeleleye]ele]elelfefefeyeyele 
fe 
Fo) |oneaiors a] folfe| [effelololeelelfe[elolelo[elfelo[o[e[efolfo[olelelale 
Tan TA Ia ial ia 
u u“ ra ua] us| ue 8 |u| us 88 sf | us 8 uefa] us 8 
z ra a acd a 
a0 ra a0 ra ae 
(eri colflceifcrl ceils] [ess] [es lfes] [esis] [us fis [es [us[esfis]us fee fis us fis es] us ffs] es [is fis [is [us fis]is| us| uses [is 
we [ie] elf te [ts] ce fe [ue] cof ts [ta] ce [te |e [us | co [ cof ffs [ee fus] ce | [coffe] [to [es [cs fis |fts [cow [us| ee [ew 
+e [se] se [| se [se] se] se [=| se ]| se [se] se | se [se] se 
z| (elz] [ef] eye) ele] in [ [a [el] wa 
7 aa OOOO LT OOOO NT OOOONRTOOOONS 
[sz OOLae > COLE >> COLE COLE 
BAe jeyeel~ <q <"h za <e: 
J] a] a] s 
5 
go 
fale g2 8a 
O Tr Orr Orr Orr Orr Ore o 
O > OF GOS GOS GHS Guns 
< 9 < zt zt < 


Young Postmitotic Cells: DCX 


E40 E50 E70 E80 E90E120 Omo 3mo 12mo 48mo 


zie ele i a a lle 
efelefeyelele|efefefelelfelefefe[e[elfefefefelele 
SOSUSS SSSSOo SSShoo SoeEoo 
ca ma aa 
Pay fed ey caf | ae sa] of | a] oe 
re rt 
ri rc 


IZ El iM 
: Eo 
|s2 
zo] vol vz vz vhs 
vz] [vif vz ea val val va 
8 5 8aS Sas 
<< << 
E40 E50 E70 E80 E90E120 O0mo 3mo 12mo 48mo 
[mz [fue] [wee] [nz ffz] [oc focz] [oe fone] [ie i] (al 
| fella] [elfefefepelelelfelele|ele|elfejee[e[elelfelelelelele 
[cPe| Bi Ree ORS CeO SSCdcS SSCcne Cocoos Deo 
re rm re aa ia 
u u ag tafe] ES uff ES ull usu] ES 
rs re ra 
[cri] cull ceifor cei S| [uses] [esffes] [uses] [us ffes]es[eses]es [us fuses us| es] uses fuses [is |is|us [uff is] us [es] es] us [us IB] 
Oo OOO Soo O00 | oo ces wl a | wie] 
Soo Goo ceo Goo oes 
z] (elle) tele) lel ele a @ fal jwu 
ODVONTOOOONTOOOCONT YOOOONRS 
scold soll s2olzel zo] azoloed sz OOLeE >> OOLL>>OOLE>=> OOLE>> 
Ele sz sz|sz] sz iq <a <2 qa 
szi|szs| s2i}| szisa] szx sai sz Ts a] no] no] 
a zal zac zl vz Sace che - ‘ati 
q 
vas] [veal vail veil veel vasfvail ves 8 a5e58 


8 S Bas Bas Sa 
t° <4 <a < 


v1 
ACG 
S1 
vi 
ACG 
S1 
v1 


Extended Data Figure 2 | Canonical cell type marker gene expression 
across cortical development. Heat maps of average gene expression 

in cortical layers at different prenatal (E40-E120) and postnatal ages 
(0-48 months). Anterior cingulate gyrus (ACG) and primary visual 
cortex (V1) were sampled at all ages, while primary somatosensory cortex 
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Oligodendrocytes: MOBP 


E40 E50 E70 E80 E90E120 Omo 3mo 12mo 48mo 


MZfuz] [wz|fvz] [rene] [we wziifwz] [uzlfac] fw u u ult 
@] le] PPP eP Pee eee ele eee rr eerrerig 
leo] |crofficro] f° 
| [ete] folelelelstale olelelelelelfelelelelale 
aa a i , 
ra a = 
colealcelal=] [ele] felts] fell] fslfspetelsts Seogos Geeoos 
te [ts] ue ff ww [ez] co ff cw [es] ce ff ve [vs] ve ff vw 6 |G] 
z z] tee] [elle] (ele) fe | Bl Bw) 
DOONT VOOONTOOOONT OOOONS 
sez zs so Oba >> ook e-> ooke-> ooke>> 
sz} |s2l) sz|se| se sz| |sz a. qa qa <f 
‘s2i|szi] sail szi|s2i| sz sz |sz] szx Ts ao] no] cs 
vze| {zal vzohzl val) vzohze vz 
vz|ve| vel vz|vzl vzlflvz| | vz 2ckRm 
hai] [val valve ve va ve va xe 
© FT Orr OTE O o'r 
09> 6 


& gas 

<< < < 
(S1) was sampled in a limited set of layers prenatally. Several additional 
prefrontal and visual areas were sampled postnatally: orbital gyrus (OG), 
dorsolateral prefrontal cortex (dIPFC), rectal gyrus (RG) and secondary 
visual cortex (V2). 
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Extended Data Figure 3 | Sex differential expression is limited to two Y 
chromosome genes. a, Quantile—quantile plot of observed versus expected 
sex differential expression for all 12,441 genes across brain regions during 
prenatal development. A linear mixed model was fit to all prenatal brain 
samples with a fixed effect for sex and random effects for brain region, 
age and donor (see Supplementary Table 11). Genes were ordered by the 
observed sex effect and plotted versus the expected sex effect based on 
permutation testing. A 95% confidence interval was calculated (dashed 
line) based on permutations, and 11 genes in males and no genes in 
females were more highly expressed than expected by chance. Seven of 
these 11 genes were nominally significant and included at least two Y 
chromosome genes (EIFIAY, LOC720563) and potentially a third gene 
(LOC693361) whose microarray probe maps to an unannotated region of 
the Y chromosome. 
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Enrichment for autophagy of the most dynamically increasing (upper 
triangle) and decreasing (lower triangle) genes with samples ordered 

and labelled as in Fig. 3b. For each gene list, the colour corresponds to 

the proportion of GO terms that are more specific (that is, ‘child’) terms 
subsumed under autophagy (GO:0006914) based on the GO hierarchy 

and that are significantly enriched (nominal P < 0.05). Note that 
autophagy was selectively turned on in occipital cortex after infancy and in 
hippocampus after juvenility (arrows). 


Extended Data Figure 4 | Expression rates of change have similar 
developmental trajectories across all brain regions. a, Rates of 
expression change in all available brain subregions and ages. b, Box plots 
summarizing the number of significantly increasing or decreasing genes 
between adjacent time points in all subregions. At all ages, the majority 

of subregions had at least 1,000 genes (red line) that were significantly 
changing (ANOVA FDR < 0.05, fold change >1.25). c, Regional specificity 
of increased autophagy may reflect differential timing of synaptic pruning. 
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Extended Data Figure 5 | Variable synchrony of biological processes were greater than expected by chance (black, dotted line). c, Rank ordered 
between brain regions. a, Example of variable timing of GO process timing of GO processes in Fig. 4c with weighted average rank for each 
activity (black boxes) between regions, resulting in different age overlaps region (asterisks). d, Developmental expression of mature oligodendrocyte 
(mini-table below). Note that E50 was the earliest age for which we markers in four myelin-enriched brain subregions with early increased 
could calculate expression change. AM, amygdala; BG, basal ganglia; HP, expression in globus pallidus (arrows) for MOG and ERMN, but not MAL 
hippocampus; NCX, neocortex. b, Average pairwise age overlaps (black, and ASPA. 
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Extended Data Figure 6 | Neurogenesis and gliogenesis in S1 occur at a 
time course intermediate between V1 and ACG. a, Genes with enriched 
expression in ACG relative to V1 between E70-E90 also show enriched 
expression in S1 relative to V1, suggesting that the timing of primary 
sensory regions is non-uniform in L5/L6 (top), subventricular zone (SZ) 
(middle), and ventricular zone (VZ) (bottom). Each plot shows the average 
enrichment (log, fold change) in S1 vs V1 (y axis) compared with the 
average enrichment in ACG vs V1 (x axis) between E70-E90 for all genes 
significantly enriched in ACG in at least two of the three ages between 
E70-E90 (Fig. 5a). b, Marker genes for cell types (compare with Fig. 5e-h) 
show expression patterns in $1 which are either consistent with ACG 
(that is, GAD1) or intermediate between V1 and ACG (that is, AQP4). 

c, Genes with enriched expression for V1 (FGFR3) or ACG (CBLN2) 
across development show intermediate expression in S1, suggesting that 
these genes may show cortical gradients rather than specific expression in 
V1 or ACG. d, Genes with common expression patterning in V1 and ACG 
can show different patterns in S1, suggesting that rate of neuron and glia 
development is not the whole story. For example, SPARCL1 and ABCA8 
both show increased expression with time in VZ with V1 showing a delay 
relative to ACG; however, these two genes show different temporal delays 
in S1. 
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Extended Data Figure 7 | Evolutionary conservation of developmental 
expression. a, Median pairwise correlations of expression trajectories 

in prefrontal cortex within human (black) and between human and 
rhesus monkey (red) decrease for less variable genes in rhesus monkey 
(genes ordered by standard deviation of expression across ages). b, Left: 
proportion of genes assigned to different conservation categories is robust 
to correlation threshold. Right: developmentally dynamic genes are more 
highly conserved (genes ordered by same method as in a). For each set of 
genes, the average + standard deviation of the proportion of genes in each 
conservation category was estimated using correlation thresholds ranging 
from 0 to 0.9. c, Left: segmented linear fits of expression for example 

gene with estimated breakpoints in each species (dashed lines). Right: 
distribution of breakpoint ages for 179 decreasing genes with good fits to 
the model in frontal and primary visual cortex. Colours and symbols are 
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previously published studies (see Methods). e, Breakpoint comparison 

of 179 increasing genes including 81 synapse related genes (red) between 
cortical areas within species. Genes that fall on the lines peak at the same 
age in primary visual and prefrontal cortex. f, g, Comparison of breakpoint 
timing between human, rhesus monkey and rat in prefrontal cortex (f) and 
additional brain regions (g). Genes that plateau in expression after their 
breakpoint in human (grey points), and genes that significantly decrease 
(blue symbols) or increase (red symbols) expression with 95% confidence 
intervals (grey lines) of breakpoints. Black lines correspond to equal 
(solid) -- a window (dashed) of developmental ages between species. 
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Extended Data Table 1 | Rhesus monkey donor information 


Indian Chinese 

origin origin 
Animal ID Age SX date Sex proportion proportion 
MMU37857 E40 2/9/2012 Female 15/16 1/16 
MMU38481 E40 5/2/2012 Male 1 0 
MMU31370 E40 3/26/2013 Male 1 0 
MMU36477 E40 4/16/2012 Female 1 0) 
DAM37624 £50 1/13/2012 Male 5/8 3/8 
MMU36725 E50 4/23/2012 Male 1 0 
MMU30213 E50 3/6/2012 Male 1 0 
MMU38099 E50 2/23/2012 Female 15/16 1/16 
DAM35650_ ~=E70 1/23/2012 Female 1 0 
MMU32143 E70 3/9/2012 Female 1 0 
MMU33757 E70 3/5/2012 Male 15/16 1/16 
MMU35475 E70 3/5/2012 Male 7/8 1/8 
MMU35058 E80 2/2/2012 Female 15/16 1/16 
MMU37852 E80 3/29/2012 Female 1 0 
MMU34624 E80 4/19/2012 Male 3/4 1/4 
MMU36435 E80 4/30/2012 Male 15/16 1/16 
MMU34326 E90 4/16/2012 Male 1 0 
MMU35674 E90 4/13/2012 Female 1 0 
MMU38285 E90 4/13/2012 Female 13/16 3/16 
MMU38521 E90 4/27/2012 Male 1 0 
MMU27052 =E120 = 3/23/2012 Male 1 0 
MMU30021 E120 3/7/2012 Female 1 0 
MMU34501 =£120 = 7/30/2012 Male 3/4 1/4 
MMU29380 E120 3/12/2012 Female 1 0 
MMU39867 OM NA Male NA NA 
MMU39893 OM 4/12/2013 Male 3/4 1/4 
MMU40864 OM 4/21/2014 Male 1 0 
MMU40944. OM 4/29/2014 Male 1 0 
MMU39465 3M 11/13/2008 Male 1 0 
MMU39538 3M 5/10/2013 Male 1 0 
MMU39766 3M 7/15/2013 Male 7/8 1/8 
MMU40525 =3M 6/9/2014 Male 1 0 
MMU39420 12M _ ~— 6/26/2013 Male 1 0 
MMU39620 12M — 3/9/2014 Male 7/8 1/8 
MMU39645 12M = 3/9/2014 Male 7/8 1/8 
MMU36322 48M — 7/11/2013 Male 1 0 
MMU36358 48M — 7/14/2013 Male 5/8 3/8 
MMU36468 48M 6/26/2013 Male 5/8 3/8 


Subject information for 38 rhesus macaque monkeys included in study. Surgery (SX) date, age, sex, and ancestry based on origin of parents and grandparents. 
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Human gut microbes impact host serum 
metabolome and insulin sensitivity 


Helle Krogh Pedersen!*, Valborg Gudmundsdottir!*, Henrik Bjorn Nielsen!*, Tuulia Hyotylainen?*;**, Trine Nielsen™*, 
Benjamin A. H. Jensen®, Kristoffer Forslund’, Falk Hildebrand’*’, Edi Prifti!°", Gwen Falony??, Emmanuelle 

Le Chatelier!, Florence Levenez!”, Joel Doré!-, Ismo Mattila**, Damian R. Plichta!, Paivi Poho*", Lars I. Hellgren!, 
Manimozhiyan Arumugam’, Shinichi Sunagawa”!®, Sara Vieira-Silva?’, Torben Jorgensen!”!®, Jacob Bak Holm®, 
Kajetan Trost!*, MetaHIT Consortium}, Karsten Kristiansen®!°, Susanne Brix!, Jeroen Raes*®-®!?, Jun Wang®!9.20.21,22, 
Torben Hansen>*, Peer Bork’?4?>?6, Soren Brunak!?’, Matej Oresic*:*4, S. Dusko Ehrlich'?8 & Oluf Pedersen>” 


Insulin resistance is a forerunner state of ischaemic cardiovascular disease and type 2 diabetes. Here we show how the 
human gut microbiome impacts the serum metabolome and associates with insulin resistance in 277 non-diabetic Danish 
individuals. The serum metabolome of insulin-resistant individuals is characterized by increased levels of branched-chain 
amino acids (BCAAs), which correlate with a gut microbiome that has an enriched biosynthetic potential for BCAAs and is 
deprived of genes encoding bacterial inward transporters for these amino acids. Prevotella copriand Bacteroides vulgatus 
are identified as the main species driving the association between biosynthesis of BCAAs and insulin resistance, and in 
mice we demonstrate that P. coprican induce insulin resistance, aggravate glucose intolerance and augment circulating 
levels of BCAAs. Our findings suggest that microbial targets may have the potential to diminish insulin resistance and 
reduce the incidence of common metabolic and cardiovascular disorders. 


Insulin resistance (IR) and metabolic syndrome are risk factors for 
both type 2 diabetes and ischaemic cardiovascular diseases, pathologies 
that are in epidemic growth worldwide. Mounting evidence suggests 
a link between the gut microbiome and human metabolic health!4, 
with transferability of insulin resistance phenotypes through faecal 
microbiome transplants”®. These effects may partly be mediated 
through the metabolome’. Serum levels of amino acids, most 
consistently the BCAAs®”, triacylglycerols with low carbon number 
and double bonds!™"', as well as specific membrane phospholipids”, 
have previously been associated with IR and future risk of metabolic 
and cardiovascular morbidities. However, the origin of the abnormal 
IR-associated serum metabolome is largely unknown”. 

To explore the relationships between the fasting serum metabolome 
and the gut microbiome in the states of IR and metabolic syndrome, 
we examined 291 non-diabetic Danish adults, of whom 277 had 
available gut microbial data from the MetaHIT! study population. 
IR was estimated by homeostatic model assessment (HOMA-IR), 
which in epidemiological studies is a widely applied measure of IR, 
primarily as an estimate of hepatic IR". Since IR is largely influenced 
by body mass index (BMI), we also estimated the IR index adjusted for 
BMI (HOMA-IRpMiaqj). Metabolic syndrome was defined according 


to recommendations by the International Diabetes Federation’? 
(see Methods). For comparative analyses we further included 
75 Danish type 2 diabetes patients* with quantitative gut metagenomics 
data generated by the same experimental protocol. Characteristics of 
the study samples are given in Supplementary Table 1 and Extended 
Data Fig. 1. 


Results 

In the present study, we assessed the role of the gut microbiome as a 
source for key features of the serum metabolome profile, predicting 
metabolic and cardiovascular disorders in non-diabetic lean and obese 
people. Untargeted metabolome profiles were generated on fasting 
serum samples, applying two mass spectrometry-based analytical 
platforms and providing information about 325 polar metabolites 
(94 known, 231 unknown) and 876 molecular lipids (289 known, 
587 unknown), respectively (here collectively termed serum 
metabolites). These were binned into 74 co-abundance clusters across 
all individuals (Fig. 1a, Supplementary Tables 2 and 3). We found 19 of 
the 74 metabolite clusters (26%), comprising 26 polar metabolites and 
367 lipids, to be significantly associated with both IR and metabolic 
syndrome, with consistent directionality across the 291 non-diabetic 
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Figure 1 | Overview of the workflow integrating human phenotypes, 
fasting serum metabolome, gut microbiome data and mouse feeding 
experiments. a, Metabolites were summarized as co-abundance clusters, 
and functional module and species abundance profiles extracted from 

gut microbiome data. b, Features were filtered for significant positive 

or negative associations with HOMA-IR, HOMA-IRgmitaqj or metabolic 
syndrome. c, Metabolite clusters were divided into IR- and IS-metabotypes 
and associated with microbiome functional modules. d, Microbial driver 
species for the functional module associations with HOMA-IR were 
identified using a leave-one-out analysis. e, Experimental design for 

P. copri feeding mouse experiment. HED, high-fat diet; OGTT, oral glucose 
tolerance test; ITT, insulin tolerance test; MetS, metabolic syndrome. 


individuals (Spearman rank correlation test, false discovery rate 
(FDR) < 0.1, Extended Data Fig. 2 and Supplementary Tables 4 and 5). 
These 19 metabolite clusters separated into two groups that were 
either positively or negatively correlated with both IR and metabolic 
syndrome, referred hereafter to as the [R-metabotype and insulin 
sensitivity (IS)-metabotype, respectively (Fig. 1b, Supplementary 
Table 6, Supplementary Results and Extended Data Fig. 3). Most 
of these associations were confirmed in type 2 diabetes patients 
(see Supplementary Results for details). 

In accordance with previous reports, the identifiable serum 
metabolites constituting the IR-metabotype mainly comprised 
amino acids (including BCAAs; see Supplementary Discussion), 
tricarboxylic acid cycle metabolites, and a large number of tria- 
cylglycerols (Supplementary Table 6, previous findings referenced 
therein). Interestingly, we found that serum BCAAs clustered with 
hydrocinnamic and indole-3-lactic acids (metabolite cluster M10, 
Supplementary Table 2), both of which belong to a class of phenolic 
compounds that are associated with gut microbial metabolism'®. Serum 
levels of hydrocinnamic acid are affected by gut microbiota in mice'” 
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and hydrocinnamic acid is a known inhibitor of branched-chain c- 
keto acid dehydrogenase kinase, which regulates the breakdown of 
BCAAs!®. By contrast, the IS-metabotype consisted solely of serum 
lipids, primarily phospholipids and triacylglycerols with odd carbon 
number and high double bond content (Supplementary Table 6). 
Hence, our results confirm previous reports of odd-chain fatty acids 
and polyunsaturated fatty acids as associated with reduced risk of type 
2 diabetes'®!”. Plasma odd-chain fatty acids are often considered to bea 
product of microbial metabolism or a marker of dietary intake, especially 
of dairy products, although recent findings show that C15:0 and C17:0 
are also endogenously produced in adipocytes through a-oxidation 
of palmitic and stearic acid?! Based on our data, however, it is not 
possible to distinguish if these odd-chain fatty acids originate from 
dietary intake or bacterial metabolism. Surprisingly, the IS-metabotype 
also included C16:0-ceramide and sphingomyelin. Elevated circulat- 
ing concentrations of ceramides”’, including the C16:0 ceramide?>4, 
are generally implicated in the pathogenesis of IR. However, our data 
suggest that C16:0-ceramide may play a more complex role in the 
regulation of host insulin sensitivity than recognized until now. 

Some of the IR- and metabolic-syndrome-associated fasting serum 
metabolites, such as the essential BCAAs, cannot be synthesized 
by humans and must therefore originate from ingested food or gut 
microbial synthesis. Therefore, we investigated the metabolic potential 
of the gut microbiome in relation to these metabolites using KEGG 
functional modules (Fig. la), that is, a set of manually curated 
functional and pathway units from the KEGG database, each consisting 
of a number of KEGG orthologous gene groups”. 

Forty-one of the 567 microbiome functional modules (7%) were 
significantly associated (Mann-Whitney U-test, FDR < 0.1) with one 
or more of the IR and metabolic syndrome phenotypes (Extended 
Data Fig. 2, Supplementary Table 7). All 41 functional modules were 
furthermore associated with the IR- and IS-metabotypes (Figs 1c and 2), 
with a majority also differing in abundance in the expected direc- 
tion in type 2 diabetes patients (Supplementary Results). A majority 
(23/33) of the HOMA-IR associated modules remained significant after 
adjusting for BMI, while six additional modules were identified upon 
the BMI adjustment (Extended Data Fig. 2, Supplementary Table 7). 
The results suggest that, in this sample of lean and obese individuals, the 
functional microbial shift in IR is largely independent of BMI, which 
should be further validated in population-based cohorts. Furthermore, 
we found this functional shift to also partly overlap with functional 
characteristics of a low richness gut microbiome, as the IR associations 
of many functional modules (24/33) were attenuated after adjusting for 
microbial gene richness (Supplementary Results). We observed only a 
minor separation in the microbiome or metabolomic data correspond- 
ing to patient subgroups based on the five clinical components of the 
metabolic syndrome (Supplementary Results). 

The cross-domain associations between the IR and metabolic syn- 
drome phenotypes, the serum metabolome and the gut microbiome 
described above may suggest functional relationships. Notably, the 
functional modules of the microbiome that were positively associated 
with both HOMA-IR and the IR-metabotype contained enzymes for 
biosynthesis of BCAAs, cofactors, vitamins, lipopolysaccharides and 
various transport systems, some of which have elevated expression 
in gut microbiomes transplanted into mice from obese donors°®. In 
contrast, the microbiome negatively associated with HOMA-IR and 
the IR-metabotype contained functional modules important for 
methanogenesis, pyruvate oxidation and transport systems, including 
inward transport of BCAAs. Noticeably, the microbiome functional 
modules that were associated with the two metabotypes showed a large 
overlap with the metabolic pathways essential for microbiome-host 
interactions mediated through metabolites, including the BCAAs, as 
highlighted in a recent Caenorhabditis elegans study”°. 

Coherently, the majority (60/81) of the microbial species associated 
with IR or metabolic syndrome (Supplementary Table 8) correlated 
significantly with the IR- and IS-metabotypes (Supplementary Results, 
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Extended Data Figs 4 and 5). To identify the species that contributed the 
most to the associations between the microbiome functional modules 
and HOMA-IR, we iterated the correlation analysis, leaving out 
genes from a single species in each iteration (see Fig. 1d and Methods 
for details). For several functional modules, the correlations with 
HOMA-IR were driven by only a few species (Supplementary Table 9). 
Notably, the positive associations between HOMA-IR and the biosyn- 
thesis modules of BCAA, tryptophan and lipopolysaccharides were 
largely driven by P. copri followed by B. vulgatus, as they were almost 
abolished when genes from these species were omitted. Although these 
two species were the main drivers for many of the same functional 
modules, they were at the same time significantly anti-correlated 
(Spearman correlation coefficient = —0.38, P=9.0 x 107 "', Extended 
Data Fig. 6), consistent with the previously reported trade-off 
between these genera”’. Still, the abundance of each of these species 
(when present) correlated positively with HOMA-IR (Extended Data 
Fig. 7, Supplementary Table 10), as did their combined abundance 
across all individuals (Spearman correlation coefficient = 0.20, 
P=0.001, Extended Data Fig. 6). The possible importance of P. copri 
is further highlighted by its positive correlation with the BCAA- 
containing metabolite cluster M10 in the 94 individuals with detectable 
levels of the species (Spearman correlation coefficient = 0.22, P=0.034, 
Extended Data Fig. 7, Supplementary Table 10). However, the corre- 
lations of P. copri to the BCAA-containing metabolite cluster M10 
and HOMA-IR were abolished in the full cohort of 277 non-diabetic 


IB Negative association 
i Positive association 
No significant association 


-0.2 0 0.2 


Leucine biosynthesis 
Isoleucine biosynthesis module1 


individuals (P=0.44 and P= 0.67, respectively), suggesting that in 
the absence of P. copri other species may influence the levels of the 
M10 metabolite cluster and HOMA-IR. P. copri, a hallmark species 
of the Prevotella enterotype”® (see Supplementary Results for further 
discussion), has been associated with rheumatoid arthritis”? but not 
with IR in humans. B. vulgatus is reported to correlate positively with 
fat mass and other markers of IR in humans*”, whereas a mouse study 
suggested protection against the development of metabolic disorders, 
although only in the context of a low-fat diet®. Thus, the role of 
B. vulgatus in IR and metabolic disorders seems complex and is proba- 
bly context-specific. Among other driver species for the microbial func- 
tional modules positively correlated with HOMA-IR were Escherichia 
coli, Sutterella wadsworthensis and unclassified Sutterella and Prevotella 
species (Extended Data Fig. 8, Supplementary Table 9). 

In contrast to the functional modules enriched in the gut microbi- 
ome of insulin-resistant individuals, which were driven by relatively 
few microbial species each with high impact, all microbial functional 
modules associated with increased insulin sensitivity were driven by 
multiple species, all with minor effect (Supplementary Results). 

The microbial functional module analysis presented in Fig. 2 high- 
lighted an increased BCAA biosynthesis potential, but depletion 
for genes encoding the transport system for bacterial BCAA uptake 
(referred to as inward BCAA transport) in the gut microbiome of 
insulin-resistant individuals (Fig. 3a). These findings synergistically 
contribute to an increased BCAA pool and are of considerable interest 
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Figure 2 | Association map of the three-tiered analyses integrating the 
phenome, the gut microbiome and the fasting serum metabolome in 
277 non-diabetic individuals with available metagenomic data. The left 
panel shows significant associations (Mann-Whitney U-test FDR < 0.1) 
between microbial functional modules and the indicated phenotypes; 
colouring indicates direction of association. The right panel shows 
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associations between the same modules and serum metabolite clusters. 
Colouring represents the median Spearman correlation coefficient 
between metabolite clusters and the indicated functional modules, 
corrected for background distribution (SCCp¢ aq, see Methods), where 
MWU EDRs are denoted: +, FDR < 0.1; *, FDR < 0.01; **, FDR < 0.001. 
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Figure 3 | The ratio between the gut microbiome potential for BCAA 
biosynthesis and inward transport is linked to fasting serum BCAA 
levels and IR in humans and can be attributed to a few driver species, 
including P. copri that induces an aggravation of glucose intolerance 
in mice concomitantly with elevated serum BCAA levels. a, The 

total abundance of all microbial species containing genes coding for 
BCAA biosynthesis (Spearman correlation coefficient (SCC) = 0.30, 
P=5.3 x 10-”) and inward transport (SCC = —0.17, P=4.5 x 10~°) 
potential is shown in green and blue, respectively, for 277 non-diabetic 
individuals ordered by their HOMA-IR levels. The slopes are significantly 
different (P= 4.0 x 108, see Methods). b, Fasting serum BCAA levels 

in 291 non-diabetic individuals ordered by HOMA-IR levels 

(SCC: leucine = 0.40, isoleucine = 0.44, valine = 0.49, P< 4 x 1071”). 

c, d, The effect of specific microbial species on associations between 
BCAA biosynthesis and transport, respectively, and HOMA-IR in 277 
non-diabetic individuals; illustrated by the change in background-adjusted 
median SCC (SCCp¢.agj) between HOMA-IR and the BCAA biosynthesis/ 


in the light of the report demonstrating that serum BCAA levels are 
increased in mice following transplantation of stools from insulin- 
resistant individuals®. The described gut microbiome features are 
consistent with the elevated serum levels of BCAAs in insulin-resistant 
individuals (Fig. 3b), suggesting that an increased ratio between gut 
microbial biosynthesis and uptake of BCAAs contributes to elevated 
serum BCAA concentrations, which are known to associate with IR. 
Moreover, the difference between the potential for BCAA biosynthesis 
and inward BCAA transport of the gut microbiome was even more 


Minutes post insulin injection 


P.copri abundance (a.u.) 


transport functional modules when a given species has been excluded 
from the analysis (see Methods). e, Suggested model of the microbiome 
contribution to serum BCAA levels and IR. Fasting serum BCAA levels 
are influenced by microbial BCAA biosynthesis and uptake; these levels 
in turn may influence host insulin sensitivity. f, Changes in mouse serum 
BCAA levels (mean + s.e.m.) after P. copri or sham gavaging for two weeks 
before challenge. Asterisks indicate significance between P. copri-gavaged 
(n= 12) and sham-gavaged control mice (m= 12) and significance 
relative to before challenge (likelihood ratio test, *P < 0.05; **P<0.01). 
g, Insulin tolerance test. The P. copri-gavaged mice (n = 12) had 
significantly higher serum glucose levels compared to sham-gavaged 
controls (1 = 12, P=0.032, repeated measures two-way ANOVA) after 
three weeks challenge. Mean +s.e.m. is depicted. Asterisks indicate 
significant differences at individual time points (repeated measures 
two-way ANOVA): *P < 0.05; **P < 0.01. h, Faecal P. copri abundance 
(arbitrary units, quantitative PCR normalized 16S rDNA) as a function of 
HOMaA-IR two weeks post bacterial challenge (SCC = 0.46, P= 0.040). 


significantly associated with the serum BCAA concentrations than 
either of the two functional modules alone (Supplementary Table 11). 
Importantly, while the IR-associated increase in the microbial potential 
for BCAA biosynthesis was largely driven by P. copri or B. vulgatus 
(see above), the reduced capacity for bacterial BCAA uptake was driven 
by reduced abundance of multiple species, including B. crossotus and 
E. siraeum (Fig. 3c, d). 

Consistent correlations with HOMA-IR were observed for almost 
all of the enzymes and transport system subunits within the BCAA 
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biosynthesis and transport modules respectively, lending further 
credence to the importance of these modules (Extended Data Fig. 9, 
Supplementary Table 12). Notably, of the BCAA biosynthesis enzymes, 
p-citramalate synthase showed the strongest individual positive 
correlation with HOMA-IR (Spearman correlation coefficient = 0.24, 
P=4.89 x 107°). p-citramalate synthase constitutes the first step 
in a less common threonine-independent isoleucine biosynthesis 
pathway**. While most microorganisms synthesize isoleucine from 
threonine*’, 87 gut microbial species identified in our cohort (including 
both P copri and B. vulgatus, Supplementary Table 13), contained the 
p-citramalate synthase gene, highlighting the importance of this less 
common pathway in the human gut microbiome. 

Our findings are summarized in the model shown in Fig. 3e, which 
offers a mechanistic explanation for the increased circulating BCAA 
concentrations that are known to correlate positively with IR*! and 
type 2 diabetes®. The difference in functional potential between 
microbial BCAA biosynthesis and BCAA uptake explained 2.4-3.1% 
of the variance in serum BCAA levels (Supplementary Table 11). 
In comparison, less than 1% of variance is explained by currently 
identified variants in the human genome, although the heritability of 
fasting serum BCAA levels** is estimated to be 0.41-0.49. 

P. copri was the strongest driver species for the positive association 
between microbial BCAA biosynthesis in the gut and IR, suggesting 
a possible causal relation. This assumption is further supported by 
several reports associating Prevotellaceae***’, and more specifically 
P copri??®, with intestinal inflammation, an important contributor 
to host IR. To experimentally address the relation between P. copri 
and abnormal glucose metabolism, we compared P copri (n= 12) 
and sham-gavaged (n = 12) C57BL/6J male mice on a high-fat diet 
(Fig. le). Notably, two weeks of P. copri challenge aggravated glucose 
intolerance (Extended Data Fig. 10a) and increased serum total BCAA 
levels (compared with baseline: likelihood ratio test, P= 0.004, Fig. 3f). 
Insulin sensitivity was reduced after three weeks of P. copri challenge 
(Fig. 3g, P= 0.032), suggesting a potentially causal role of P. copri in 
mediating the increase in serum BCAA and IR. The relatively modest 
effect of the P copri inoculation on serum BCAA levels may reflect that 
P. copri correlated more strongly to HOMA-IR than to BCAAs in the 
human cohort (Supplementary Table 10). Importantly, upon glucose 
challenge, the plasma insulin excursion rates were indistinguishable 
between groups (Extended Data Fig. 10b; P=0.80), hence negating 
beta cell malfunction as a confounding variable. It should be noted 
that Prevotellaceae (and thus P copri) were a minor component of the 
mouse microbiome in both groups upon feeding with a high-fat diet, 
as evidenced by faecal 16S rDNA amplicon analysis (Extended Data 
Fig. 10c). Despite minor compositional changes in the gut microbiota 
(adonis, P=0.04), we found no alterations of the relative abundances 
of any single family component thereof to be significant upon gavage 
(Extended Data Fig. 10c, FDR > 0.05). In contrast, P. copri was sig- 
nificantly increased (Extended Data Fig. 10d, P=0.0058), suggesting 
that it contributed to the observed pathophysiological changes. This 
conclusion is further supported by the direct correlation between 
P. copri abundances and HOMA-IR (Fig. 3h, Spearman correlation 
coefficient = 0.46, P= 0.040). Interestingly, an inverse relationship 
between glucose intolerance and P. copri challenge was recently 
reported*’. However, in the latter study a completely different dietary 
regimen, high in fibre and low in fat, was used, suggesting that the effect 
of P. copri is diet-dependent. Furthermore, it should be noted that the 
feed used in the present study contained soy protein, which is low in 
BCAAs, as the protein source, contrasting the use of casein, high in 
BCAAs, in the study by Kovatcheva-Datchary et al.*”. 


Discussion 

By integrating data on host insulin sensitivity and metabolic syndrome, 
gut microbiome, and fasting serum metabolome, we were able to 
demonstrate clear metabolome signatures of IR phenotypes among 
non-diabetic individuals and validate them in patients with type 2 
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diabetes. The IR-associated metabolome associates with functional 
components of the IR-linked gut microbiome: notably the increased 
potential for lipopolysaccharide and BCAA biosynthesis, and reduced 
potential for BCAA transport into bacterial cells, methanogenesis and 
pyruvate oxidation. 

Studies demonstrating the association of fasting serum BCAA levels 
with incident type 2 diabetes®, the normalization of BCAA levels in 
obese individuals after bariatric surgery“ and the development of IR 
in rats after BCAA diet supplementation”! suggest a potentially causa- 
tive role of the BCAAs, or their breakdown products*!?, in metabolic 
disorders, although the reason as to why they are elevated in the first 
place is not well established. Potential explanations include reduced 
BCAA breakdown in adipose tissue*** and liver*’. Consistent relation- 
ships in our data between serum concentrations of BCAAs, bacterial 
BCAA biosynthesis and inward transport potentials, and the severity 
of IR phenotypes are reinforced by the intriguing findings that BCAAs 
co-vary tightly with fasting serum metabolites known to be of microbial 
origin. This suggests that the gut microbiota is another independent 
contributing source of elevated serum BCAA levels in common human 
states of IR. Furthermore, positive correlations between microbial 
functions—including BCAA biosynthesis—and IR are largely driven 
by a few species only, notably P. copri and B. vulgatus, suggesting that 
they may directly impact host metabolism. We tested this hypothesis 
in mice fed a high-fat diet, and found that a challenge with P copri led 
to increased circulating serum levels of BCAAs, insulin resistance and 
an aggravation of glucose intolerance. We conclude that dysbiosis of the 
human gut microbiota impacts the serum metabolome and contributes 
to insulin resistance. 

Importantly, while our findings relate to non-diabetic individuals 
and type 2 diabetes patients with preserved insulin secretion, they 
cannot yet be generalized to patients with impaired insulin secretion 
(Supplementary Discussion). Future physiological studies are needed 
to elucidate how the intestinal BCAAs and other amino acids enter the 
bloodstream and from which intestinal location they are absorbed. 
Furthermore, investigations of how dietary changes alone or in com- 
bination with microbial or pharmacological interventions may impact 
the microbiome and, in particular, influence P. copri modulation of 
serum BCAA levels will open novel avenues to counter the pathogen- 
esis of IR and its linked epidemics of common metabolic and cardio- 
vascular disorders. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Human study population. The study population comprised 291 normoglycaemic 
and middle-aged Danish MetaHIT individuals as previously reported’, as well 
as 75 Danish type 2 diabetes patients‘. All analyses involving gut microbiome 
data were restricted to 75 type 2 diabetes patients and 277 out of the 291 of the 
non-diabetic individuals whose data passed all quality control criteria (see details 
below), whereas all non-microbiome analyses involved the full set of 291 non- 
diabetic individuals. No statistical methods were used to predetermine sample size. 

Of the 75 type 2 diabetes patients, ten patients (13%) received no hyperglycae- 
mic medications, 58 patients (77%) received the biguanide compound, metformin; 
of these 28 patients (37%) received metformin as the only anti-hyperglycaemic 
medication, 10 patients (13%) received sulfonylurea alone or in combination 
with metformin, 14 patients (19%) received a combination of oral anti-diabetic 
drugs and insulin treatment and 4 patients (5%) were on insulin treatment 
only. Eleven patients (15%) were treated with dipeptidyl peptidase-4 (DPP4) 
inhibitors or glucagon-like peptide-1 (GLP-1), all of them in combination with 
metformin. Patients were reported as receiving anti-hypertensive treatment if at 
least one of the following drugs were reported: spironolactone, thiazides, loop 
diuretics, beta-blockers, calcium channel blockers, moxonidin or drugs affect- 
ing the renin-angiotensin system (in total n = 55; 73%). Patients receiving statins 
were reported as receiving lipid-lowering medication (in total n= 56; 75%). We 
have previously shown that metformin has impact on the configuration and 
functional potential of human gut microbiota whereas no significant differences 
were observed for other anti-diabetic medications, diuretics, or lipid lowering- and 
blood pressure lowering drugs’. 

In short, all study participants (Caucasian Danes) were recruited from the 
population-based Inter99 study”* or from the outpatient clinic at Steno Diabetes 
Center. Study volunteers were invited for two visits with approximately 14 days 
apart. At the first visit the participants were examined in the morning after an 
overnight fast of at least 10h and without prior morning physical activity. At the 
second visit a DEXA scan was performed to evaluate body fat percentage. All 
samples (from both non-diabetic and diabetic individuals) were generated within 
the Danish part of the MetaHIT consortium (http://www.metahit.eu), and the 
infrastructure of this project was constructed specifically to make all samples 
comparable by ensuring similar handling and by minimizing the risk of batch 
effects. All of these samples were collected individually as participant recruitment 
proceeded, by the same staff and using the same protocol for both stool and blood 
samples. Serum for metabolomics and stool for metagenomics were stored at 
—80°C until the planned analyses took place. 

The study was approved by the Ethical Committees of the Capital Region 
of Denmark (HC-2008-017 and H-15000306) and was in accordance with the 
principles of the Declaration of Helsinki. All individuals gave written informed 
consent before participating in the study. 

Phenotyping. Clinical phenotyping and fasting biochemistry were performed 
and analysed as reported'. HOMA-IR was calculated as: (fasting plasma glucose 
(mmol/1) x fasting serum insulin (mU/ 1))/22.5'4 and HOMA-B (a measure of 
pancreatic beta-cell function) calculated as (20 x fasting plasma insulin)/(fasting 
plasma glucose — 3.5)!*. Due to the confounding effects of anti-diabetic treat- 
ment, HOMA-IR and HOMA-B were only assessed in non-diabetic individuals. 
BMI was calculated as weight (kg) divided by height (m7) and obesity defined as 
BMI >30kg/m7. Metabolic syndrome was defined according to the International 
Diabetes Federation!>. Non-diabetic individuals classified as having the metabolic 
syndrome in accordance with the IDF (mn = 163) were characterized by central 
obesity defined by gender-specific increased waist circumference as well of two 
of four additional risk factors for cardiovascular disease: raised levels of fasting 
plasma glucose or triglycerides, or reduced fasting plasma level of HDL-cholesterol, 
or raised blood pressure. In order to account for differences in age, sex and BMI, 
we matched the 75 Danish type 2 diabetes patients with 75 age-, sex- and BMI- 
matched individuals chosen from the non-diabetic study samples of 277 Danish 
individuals (Supplementary Table 1). 

Generation of serum metabolome data set. The metabolomics analyses were 
performed on all human samples together, in randomized order, both related to 
the sample preparation and instrumental analysis. Similarly, all 48 mouse samples 
were analysed for BCAAs together in randomized order. Control serum samples 
consisting of pooled human serum were included in equal intervals as part of the 
sample run for both human and mouse samples (n = 30 for humans and n=4 for 
mice), as well as a set of pure standard samples (pure standards in a solvent for 
evaluation of instrument performance and robustness), extracted standard samples 
(standards going through the same sample preparation to monitor the performance 
on the whole analytical procedure), blank samples (only solvent to observe possible 
background peaks) and calibration samples. 

Metabolomics analyses were also conducted on previously reported Danish 
type 1 diabetes patients’ (n = 30 for polar metabolites, n = 31 for molecular lipids). 


These data were only used for the co-abundance clustering of metabolites 
(see below) to increase sample size for finding inter-variable conditions. Type 1 
diabetes patients are distinctive from type 2 diabetes patients as they represent 
both a very different pathology and lifelong medical/lifestyle treatment and were 
consequently not used for any other analyses. 

Human and mouse samples were analysed at VTT Technical Research Centre 
of Finland (Espoo, Finland) and Steno Diabetes Center (Gentofte, Denmark), 
respectively, using the same type of instrument with identical analytical conditions. 
Analysis of fasting serum polar metabolites by GC x GC-TOFMS. Serum polar 
metabolites were analysed in 396 human and 48 mouse serum samples (24 pre- 
and 24 post-gavage samples) using comprehensive two-dimensional gas chroma- 
tography combined with time-of-flight mass spectrometry (GC x GC-TOFMS, a 
LECO Pegasus 4D equipped with a cryogenic modulator from LECO Corp.) with 
a method described previously*”. Specifically, 400,11 methanol and 10,1 internal 
standard mixture (C17:0 (186.5 mg/l), deuterated valine (37 mg/l) and succinic 
acid-dy4 (63 mg/l)) were added to 30,11 of serum samples. The samples were vortex 
mixed and centrifuged for 5 min at 10,000 r.p.m. and half of the supernatant was 
evaporated to dryness. This was followed by two-step derivatization using methox- 
imation and silylation by first adding 25 11 methoxamine (45°C, 60 min) and then 
25 \l N-trimethylsilyl- N-methyl trifluoroacetamide (45°C, 60 min). Finally, 
a retention index standard mixture (n-alkanes, 25 11, concentration = 8 mg/l) 
and an injection standard (4,4/-dibromooctafluorobiphenyl, 50 wl, 
concentration = 10 mg/l), both in hexane, were added to the mixture. The 
calibration consisted of six points for each quantified metabolite. 

The columns were as follows: a methyl-deactivated retention gap (1.5 m x 0.53 mm 
id.) was connected to 10m x 0.18 mm Rxi-5MS (phase thickness 0.18 ,1m) and to 
1.5m x 0.1mm BPX-50 (phase thickness 0.1 }1m). Helium was used as the carrier 
gas at a constant pressure mode (40 psig). A 4-s separation time was used in the 
second dimension. The temperature program was as follows for the first-dimension 
column: 50°C (2 min), at 7°C/min to 240°C and at 25°C/min to 300°C (3 min). 
The second-dimension column temperature was 20°C higher than the correspond- 
ing first-dimension column throughout the program. 

The analytical method used allows for combined targeted and untargeted 
analysis, where a selected subset of metabolites can be quantified’”. In the present 
study, quantitation of 27 target metabolites (stearic acid, oleic acid, linoleic acid, pal- 
mitic acid, citric acid, glutamic acid, 3,4-dihydroxybutanoic acid, 3-hydroxybutyric 
acid, threonine, 2,4-dihydroxybutanoic acid, phenylalanine, tyrosine, 2-hydroxy- 
butyric acid, serine, glyceric acid, methionine, glycine, leucine, isoleucine, valine, 
ornithine, proline, cholesterol, arachidonic acid, alanine and aspartic acid) was 
done by external calibration curves for each individual metabolite. 

ChromaTOF vendor software (LECO) was used for within-sample data 
processing, and the Guineu software*” was used for alignment, normalization and 
peak matching across samples. The normalization was performed by correction 
for internal standards and specific target metabolites were additionally quantified 
using external calibration curves. Other mass spectra from the GC x GC-TOFMS 
analysis were searched against the NIST Mass Spectral Library and Golm 
Metabolome Database*’, also using retention index data in the identification. 
Artefact peaks due to chemical background and compounds outside the linear 
range of the method were removed from the data set. Control serum samples 
(n= 30 for human, n= 4 for mouse analyses) were analysed together with the 
samples. The relative standard deviation (RSD) for internal standards, spiked into 
the samples, was on average of 12.8% for the human serum sample analyses. The 
RSD% of the quantified metabolites in the control serum samples (n = 30) was on 
average 18.5% for the analysis of human serum samples. For quantitative analysis 
of BCAAs in mice serum samples, the RSD% was 13.3% for isoleucine, 11.2% for 
leucine and 24.2% for valine in control serum samples (n= 4), and for internal 
standards in mice serum the RSD was on average 16.6%. Neither sample prepara- 
tion nor analysis order showed any significant effect on the results. 

All serum metabolite peaks that were present (non-zero value) in more than 
50% of samples were included in the data analyses, including the unidentified 
ones. We reasoned that inclusion of complete data as obtained from the platform 
best represent the global metabolome as covered by the platform. The unidenti- 
fied peaks were annotated with their structural class from the Golm Metabolome 
Database** using functional group prediction based on the fragmentation 
patterns*”. In total 325 (94 known and 231 unknown) serum polar metabolites 
were measured in the human study. 

Analysis of serum molecular lipids using UHPLC-QTOFMS. Serum 
molecular lipids were analysed in 397 human serum samples using ultra-high- 
performance liquid chromatography coupled with time-of-flight mass spec- 
trometry (UHPLC-QTOFMS Q-Tof Premier mass spectrometer, Waters, Inc.) 
using electrospray ionization in positive ionization mode with a methodology 
described earlier*’. Specifically, a standard mixture 1 (20,1) containing LPC(17:0), 
PC(17:0/17:0), PE(17:0/17:0) and Cer(d18:1/17:0) (Avanti Polar Lipids, Inc.) 
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and TG(17:0/17:0/17:0) (Larodan Fine Chemicals AB), all at concentration of 
10 mg/l, was added to 101] of serum samples and the samples were extracted 
with chloroform and methanol (2:1; 100,11). The lower phase (60 11) was collected 
after centrifugation and 20 11 of internal standard mixture 2 (LPC(16:1D3), 
concentration: 10 mg/l, PC(16:1/16:1-D¢), concentration: 15 mg/l, and 
TG(16:0/16:0/16:0-!2C3), concentration: 15 mg/l, was added. The extracts were 
analysed on an Acquity UPLCTM BEH C18 2.1 x 100mm column packed with 
1.7\1m particles. The solvent system included 1% 1 M NH4Ac, 0.1% HCOOH and 
acetonitrile/isopropanol (1:1, 1% 1M NHy,Ac, 0.1% HCOOH) in gradient elution 
mode with a flow rate of 0.4ml/min. The serum lipid profiling was carried out at 
a mass range of m/z 300-1,200 with scan duration of 0.2. 

The data processing using MZmine 2 (ref. 50) included alignment of peaks, peak 
integration, isotopic grouping, normalization, and peak identification. Serum lipids 
were identified using an internal spectral library or with tandem MS and data were 
normalized using the internal standards representatives of each class of serum lipid 
present in the samples as previously described’?. Sphingomyelins were normalized 
with the PC standard. All lipid peaks were included in the data analyses, including 
the unidentified ones. Control serum samples (1 = 30), extracted standard samples 
(n= 22) as well as blanks and pure standard samples were analysed together with 
the study samples. The standard deviation (RSD) for internal standards, spiked 
into the samples, was 8-13% in serum samples (n = 397) and 9.1% in the extracted 
standards (n= 22). The RSD% of the identified lipids in the control serum samples 
was on average 12.8%. In total 876 molecular serum lipids, 289 known and 
587 unknown, were measured. Neither sample preparation nor analysis order 
showed any significant effect on the results. 

Clustering of co-abundant serum metabolites. Clusters of co-abundant serum 
metabolites were identified using the R package WGCNA® (Fig. 1a). Serum polar 
metabolites and serum molecular lipids (see above) were analysed separately but 
are collectively referred to as serum metabolites. Signed, weighted metabolite 
co-abundance correlation (biweight midcorrelations (a median-based correlation 
measure that is more robust to outliers than Pearson correlation®”) after log, 
transformation) networks were calculated across all examined individuals. 
A scale-free topology criterion was used to choose the soft threshold 3= 13 for 
the polar metabolites and = 14 for molecular lipid correlations. Clusters were 
identified with the dynamic hybrid tree-cutting algorithm™, using deepSplit 
of 4 and a minimum cluster size of 3 and 5 for polar metabolite and molecu- 
lar lipid clusters, respectively. Serum metabolites that did not fit the clustering 
criteria were combined in a group named ‘remaining’ The profile of each serum 
metabolite cluster was summarized by the cluster eigenvector (that is, the first 
principal component of the metabolite abundances across individuals). Similar 
clusters were subsequently merged if the biweight mid-correlation between 
the cluster’s eigenvectors exceeded 0.8 and 0.75 for serum polar metabolite and 
serum molecular lipid clusters, respectively. The serum polar metabolite and 
serum molecular lipid clusters (labelled MO0O-M35 and L00-L339, respectively) are 
collectively termed metabolite clusters. 

Construction of a non-redundant metagenomics reference gene catalogue. The 
reference gene catalogue used in the present study was constructed from Illumina 
shotgun sequencing data from 620 publically available faecal samples***. The data 
was processed using the MOCAT software package (version 1.1)°°. Reads were 
trimmed (option read_trim_filter) using a quality and length cut-off of 20 and 
30 bp, respectively. Trimmed reads were subsequently screened against a custom 
database of Illumina adapters (option screen_fastafile) and the human genome 
version 19 using a 90% identity cut-off (option screen). The resulting high-quality 
reads were assembled (option assembly) and assemblies revised (option assembly_ 
revision). Genes were predicted on scaftigs with a minimum length of 500 bp 
(option gene_prediction). 

Predicted protein-coding genes with a minimum length of 100 bp were clus- 
tered at 95% sequence identity using CD-HIT (version 4.6.1)°” with parameters 
set to: -c 0.95, -G 0 -aS 0.9, -g 1, -r 1). The representative genes of the result- 
ing clusters were ‘padded’ (that is, extended up to 100 bp at each end of the 
sequence using the sequence information available from the assembled scaftigs), 
resulting in the final reference gene catalogue used in this study. The reference 
gene catalogue was functionally annotated using SmashCommunity (version 
1.6)°8 after aligning the amino acid sequence of each gene to the KEGG database 
(version 62)?°. 

Profiling of metagenomics samples. We obtained metagenomics data for the 291 
non-diabetic individuals and 75 type 2 diabetes patients as previously described. 
The DNA extraction protocol was standardized for all samples and all sequencing 
was done using Illumina platforms, using paired-end reads (Supplementary 
Table 14). Of the 291 non-diabetic samples, 14 that were sequenced with the 
Illumina Solexa technology had average read length below 42 nt (leading to 
less certain mappings) and were thus excluded from further analyses to avoid 
unnecessary biases, resulting in a final set of 277 non-diabetic individuals with 
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gut microbiome data. All 75 type 2 diabetes samples passed all quality control 
criteria. Sequencing statistics for each sample have previously been described* 
and are repeated in Supplementary Table 14. Raw insert (sequenced fragments 
of DNA represented by single or paired-end reads) count profiles were generated 
using MOCAT® by mapping high-quality reads from each metagenome to the 
reference gene catalogue (option screen) using an alignment length and identity 
cut-off of 45% and 95%, respectively. The abundance profile for each catalogue 
gene was calculated as the sum of uniquely mapped sequence reads (97% identity) 
using 7M sequence reads per sample (downsized). KEGG Orthology (KO) gene 
group abundance profiles were calculated by summing the abundance of genes 
annotated to the respective KO gene group. KOs were mapped to microbiome 
functional KEGG modules based on annotations downloaded on 14 January 2014 
from the KEGG BRITE database. Gene richness was calculated as the number 
of genes with abundance higher than zero in the given sample (downsized). For 
negative binominal statistics all uniquely mapped sequence reads were used to 
estimate abundances. 

Metagenomics species construction. The gene catalogue was clustered by 
co-abundance as described by Nielsen et al.>°, which defined 10,754 co-abundance 
gene groups (CAGs) with high correlations (Pearson correlation coefficient 
>0.9). The 925 largest of these, with more than 700 genes, were considered as 
metagenomics species (MGS) and referred to as species throughout the article. 
The abundance profiles of the CAGs and MGS were determined as the medium 
gene abundance (7M reads per sample) throughout the samples. Furthermore, the 
CAGs and MGS&s were taxonomically annotated, by summing up the taxonomical 
annotation of their genes as described by Nielsen et al.°”. Each gene was annotated 
by sequence similarity to known reference genomes (blastN, E-value < 0.001 
against imomi4 genome databank, August 2012 release). 

Mouse intervention study with Prevotella copri. P. copri (CB7, DSMZ) was 
cultivated under anaerobic conditions in PYG medium (SSI Diagnostica), 
harvested in log phase, aliquoted and stored in 10% glycerol at —80°C until use. 
Mice (24 male WT C57BL/6J, Taconic, 10 weeks of age) were housed in cages of 
four and kept at 22°C in a specific-pathogen-free facility under a 12h light-dark 
cycle, fed ad libitum and had free access to water. Number of mice (sample size) was 
selected based on previous experience in measurement of dysglycaemic phenotype 
in a mouse model, but was not arrived at through formal analysis. After 2 weeks 
of acclimatization on a standard chow diet (Altromin 1310, Altromin), mice were 
transferred to a soy protein-based 60% high-fat diet low in BCAAs (12492D, 
Ssniff) in order to prime for later development of glucose intolerance and insulin 
resistance and sham-gavaged (1% glycerol in PBS) twice a week. After three weeks 
on high-fat diet, mice were divided into two equal groups of 12 mice each (three 
cages per group, minimizing the influence of potential cage effects). The groups 
were not randomized but stratified by magnetic resonance (MR)-based fat mass 
and weight. For the following three weeks, mice were gavaged twice weekly with 
100 11 of either P copri (5 x 10° CFU/mouse, 1% glycerol in PBS) or sham while 
remaining on the high-fat diet. Blood for BCAA measurements was drawn 3 days 
before diet change and 11 days post diet change in EDTA-coated tubes kept on 
ice and centrifuged at 4°C for 10 min at 1,000g before storage at —80°C until 
metabolomics analysis. 

An oral glucose tolerance test (OGTT) was performed after two weeks of P. copri 
challenge. The mice were fasted for 5h with free access to water and gavaged with 
3g glucose/kg lean mass. Blood glucose was measured in tail vein blood before 
and 15, 30, 45, 60, 90, and 120 min post the glucose bolus, using standard Contour 
Next Test Strips that met 2013 accuracy criteria requirements from International 
Organization for standardization of diagnostic tests (Bayer Contour). 

Insulin secretion capacity was evaluated by measuring plasma insulin before 
(TO) and 15 min post glucose bolus (T15) using an electrochemiluminescence assay 
(Meso Scale Diagnostics) following the manufacturer's instructions. 

An insulin tolerance test was performed after 3 weeks of bacterial challenge. 
The mice were acclimatized in clean cages without bedding for 2h and injected 
intra peritoneally with 0.75 U insulin/kg lean mass in alternating order. Insulin 
was diluted in succinulated gelatin (Gelofusine B. Braun Melsungen AG), allowing 
increased efficacy of insulin administration, thus minimizing intra experimental 
standard variations. Blood glucose was measured immediately before and 15, 
30, 45, 60, 90 and 120 min post insulin bolus, using standard Contour Next Test 
Strips that met accuracy criteria requirements from International Organization for 
standardization of diagnostic tests (Bayer Contour). 

In all mouse experiments performed, mice were handled in alternating order. 
During ITT and OGTT the operator was semi-blinded as each mouse was assigned 
a random number corresponding to the time (minute) for insulin/glucose bolus 
administration. 

For quantitative PCR and 16 rDNA amplicon sequencing, bacterial DNA 
from faecal samples were extracted using a NucleoSpin soil kit (Macherey- 
Nagel) according to manufacturer’s instructions. DNA yield and integrity 
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were assessed using a Nanodrop and agarose gel electrophoresis, respectively. 
Quantitative PCR analyses were performed using the SYBR Green qPCR 
Master mix (Thermo Scientific) and the Stratagene Mx3000P qPCR System. 
The relative abundance of P. copri was determined by normalization to the 
reference 16S rDNA gene using the comparative Cy method”? for relative 
gene expression. Primer sequences were previously published”? and are as 
follows: Universal 16S forward: 5’/-ACTCCTACGGGAGGCAGCAGT-3’, 
Universal 16S reverse: 5'/-ATTACCGCGGCTGCTGGC-3’, P. copri forward: 
5'-CCGGACTCCTGCCCCTGCAA-3’, P. copri reverse: 5'/-GTTGCGCCA 
GGCACTGCGAT-3’. 

The PCR-based library formation for the 16S rDNA amplicon sequencing was 
performed using 10ng DNA, 0.2|1M of each barcoded forward and reverse primer, 
0.2mM dNTPs and 0.5 units Phusion high fidelity DNA polymerase (Thermo 
Scientific) in a total volume of 2511. To target the 16S rRNA gene’s variable region 
4 (V4) a forward primer 515F (5’-AATGATACGGCGACCACCGAGATCTACAC 
NNNNNNNN TATGGTAATTGTGTGCCAGCMGCCGCGGTAA-3’; ‘N’ 
indicates the nucleotides of the barcode sequence) and a reverse primer 806R 
(5’-CAAGCAGAAGACGGCATACGAGAT NNNNNNNNNNNNAGTCAGT 
CAG CC GGACTACHVGGGTWTCTAAT-3’) were used, both with Illumina 
adaptor sequences in the 5’ end®*'. Cycling condition was as follows: 98°C for 
30s followed by 35 cycles of 98°C for 5s, 56°C for 20s and 70°C for 20s. PCR 
products were purified using Agencourt AMPure XP (Beckman Coulter) beads. 
Subsequently, samples were pooled in equal concentrations and sequenced using 
an Illumina MiSeq with V2 PE500 cartridge (500 cycles). 

For 16S rDNA data analysis, generated sequences were analysed using QUIIME 
v.1.9.1 with default settings. Chimaera checking was performed using UCHIME” 
and de novo OTU-picking was performed using UCLUST® with 97% sequence 
similarity. Representative sequences were assigned taxonomy against the 
Greengenes database v.13_8 using the RDP-classifier®. Subsequent analyses 
were performed in R v.3.2.3 using the metagenomeSeq®, PhyloSeq’”, vegan® and 
ggplot2® packages. Data was filtered for low-abundance OTUs by removal of OTUs 
present in fewer than 3 of the 48 samples and with a relative abundance across all 
samples 0.005%. Analyses in R were performed with an average of 51,912 + 15,173 
(s.d.) sequences per sample after filtering. For bacteria, differential abundance 
analysis we normalized read counts with metagenomeSeq® using cumulative-sum 
scaling. Statistical analysis was performed on data filtered based on effective sample 
sizes where taxa were not included if they had fewer than X effective number of 
positive samples, where X is the median of estimated effective samples per feature 
calculated using metagenomeSeq. 

All animal experiments were conducted in accordance with national Danish 

guidelines (Amendment #1306 of 23 November 2007) as approved by the Danish 
Animal Inspectorate, Ministry of Justice, permission #2014-15-2934-01027. Mice 
were kept under specific-pathogen-free conditions and experimental protocols 
were validated by in-house standard operation procedures. 
Statistical analysis. Apart from the mouse experiment, in which Graphpad Prism 
version 6, Treestar, was used, all statistical analyses were performed in the statistical 
computing language R”’. As the non-diabetic study sample comprised both non- 
obese and obese individuals, a number of the physiological variables were not 
normally distributed (Extended Data Fig. 1). Consequently, non-parametric tests 
were applied in the statistical analysis. For all analyses involving microbiome 
data, we only included microbial species and KOs present in at least three non- 
diabetic individuals to avoid artefactual results. All comparisons with a Benjamini- 
Hochberg false discovery rate (FDR) <0.1 were considered significantly different 
throughout the analysis. To focus the analysis on phenotype-relevant variables, 
we employed an initial filtering of significant phenotype association of the serum 
metabolite clusters, microbial species and microbiome-derived KEGG modules 
(Fig. 1b). Serum metabolite clusters and species associations with HOMA-IR 
were conducted with a Spearman rank correlation test. Partial Spearman rank 
correlation tests”! were used to adjust the correlations for BMI (HOMA-IRgoitaqj)> 
or gene richness (HOMA-IRGenerichness.adj) Where applicable, using the ppcor 
R package”*. Serum metabolite clusters and species abundance differences between 
individuals with and without metabolic syndrome were tested with a Mann- 
Whitney U-test. We focused the downstream analysis on the serum metabolite 
clusters associated with all three phenotypes (HOMA-IR, HOMA-IRgoqj and 
metabolic syndrome), but other metabolite clusters were significantly associated 
with one or more of the three phenotypes that were not retained for further 
study. Owing to the low number of microbiome functional modules and species 
associated with all three phenotypes, we included all functional modules and 
species associated with HOMA-IR, HOMA-IRgoiagj or metabolic syndrome in 
further analysis. 

All associations for microbial functional modules were identified using a Mann- 
Whitney U-test, where the ranks of KOs within a given KEGG functional module 
were compared with the ranks of all other KOs. For associations of microbial 


functional modules with HOMA-IR, gene richness and serum metabolite clusters 
the ranks were based on Spearman correlation coefficients, whereas partial 
Spearman correlation coefficients were used to rank KOs for association with 
HOMA-IRgoiaj and HOMA-IRGenerichness.adj» a8 it allows adjusting for BMI and 
gene richness, respectively. To quantify the shift in (partial) Spearman correlation 
for a given KEGG module compared to the background distribution, we calculated 
the background adjusted median Spearman correlation (SCCpg aaj) for a given 
KEGG module mas: 


SCCpg. adj = median(SCCxosexrGG module m) = median(SCCkos¢KEGG module in) 


where SCCxo is the (partial) Spearman correlation coefficient between the KO 
and HOMA-IR, HOMA-IRgotag, HOMA-IRGenerichness.adp gene richness or 
serum metabolite clusters. For associations of microbial functional modules with 
metabolic syndrome and type 2 diabetes, the ranks were based on Wald statistics 
for testing differentially abundant KOs with a negative binomial test, using the 
DESeq2 R package”*. In contrast to all other tests, analyses using the negative 
binomial test were based on the non-rarefied gene counts. 

To identify the main microbial species driving the association between 
functional KEGG modules and HOMA-IR described above, the calculation of the 
KO gene group abundance profiles was iterated excluding the genes from a different 
species, in each iteration (leave-one-out analysis). The effect of a given species on 
a specified association was defined as the change in median Spearman correlation 
coefficient between KOs and HOMA-IR when genes from the respective species 
were left out (DeltaSCC), that is, for species s and KEGG module m: 

) -_ median (SCC excluding species s ) 


KOs€KEGG module m KOs€KEGG module m 


DeltaSCC,,, = median(SCCX, species 


For visualizing the species’ effect on the association between HOMA-IR and a given 
KEGG module m, SCCo¢ adj,m (the background-adjusted median SCC between 
HOMA-IR and KOs in module m when including all species) was used as reference, 
that is: 


Plottedvalue;,, = SCCpg.a4j,m — DeltaSCC;, 
_ \ excluding species s z all species 
= median(SOC RS Se module “) = median (SCCRO/ektGc module a 


In addition, the magnitude of each species’ relative contribution to the correlation 
was summarized as the percentage change compared to the original median 
Spearman correlation coefficient (% SCC), that is: 


Delt 
gsc, = DeltaSCCm 199 


SCC bg.adj,m 


When testing whether regression slopes were significantly different we used the 
z score as suggested by Paternoster et al.”4 and converted it to a P value using a 
2-sided t distribution. 

For analysis of the data from the mouse intervention, the repeated blood 
glucose measurements from the OGTT were compared between the two groups 
of mice using repeated measurements two-way ANOVA, Tukey’s post hoc test 
(Sigmaplot). The area under the curve (AUC) (GraphPad) was compared using 
a Mann-Whitney U-test (GraphPad). The effect of the P copri gavage challenge 
on the combined BCAA measures (leucine, isoleucine and valine) was tested on 
log-transformed metabolite values using a likelihood ratio test (linear model 
using generalized least squares), which does not assume independence between 
the amino acid measurements from the same mouse. For this, the variances of the 
three amino acids were modelled independently. To test for differences in the gut 
microbiota composition between sham and P. copri-gavaged mice we performed 
adonis test on the 16S data using weighted UniFrac distance. To test for bacteria 
families differentially abundant between sham and P copri-gavaged mice we used 
the fitZig function in the metagenomeSeq R package. 
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Extended Data Figure 1 | Distributions of continuous physiological 
traits for the 291 non-diabetic individuals, 75 type 2 diabetes patients 
and 75 matched non-diabetic controls. An overview of the same traits is 
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shown in Supplementary Table 1. The 75 non-diabetic controls are a subset 
of the 291 non-diabetic individuals matched to the type 2 diabetes patients 
by age, sex and BMI and used for comparative analyses. 
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Extended Data Figure 2 | The number of metabolite clusters, species 
and microbiome functional modules significantly associated with 
HOMA-IR, HOMA-IRgmiagj, and metabolic syndrome. a—c, Venn 
diagrams resuming the number of serum metabolite clusters (a), species 
(b) and microbiome functional modules (c) that are associated with the 
three HOMA-IR, HOMA-IRgmiagj and metabolic syndrome phenotypes 
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Total number of significant species = 81 
HOMA-IR = 79; HOMA-IR,,,,.4, = 21; MetS = 6 
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Total number of significant functional modules = 54 
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at FDR < 0.1. d, The number of microbiome functional modules 
associated with HOMA-IR, gene richness and HOMA-IRGenerichness.adj- 
The metabolite cluster associations are based on all 291 non-diabetic 
individuals whereas the species and KEGG module associations were 
estimated on the 277 non-diabetic individuals with microbiome data. 
Mets, metabolic syndrome. 
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(blue) or negative (red) correlations and FDRs are denoted: +, FDR< 0.1; 
*, FDR < 0.01; **, FDR < 0.001. The names of the 19 metabolite clusters 
making up the IR- and IS-metabotypes are highlighted with blue 


(IR-metabotype) and red (IS-metabotype), respectively. 


serum metabolite clusters and physiological traits in 291 non-diabetic 


individuals. Spearman correlations between all fasting serum metabolite 
clusters (top panel, molecular lipids; bottom panel, polar metabolites) 


and clinical phenotypes. The metabolites in each panel are clustered by 
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Extended Data Figure 5 | Correlations between IR- and metabolic 
syndrome-associated microbial species and fasting serum metabolite 
clusters in 277 non-diabetic individuals. Spearman correlations between 
species that were significantly associated (FDR < 0.1) with HOMA-IR, 
HOMA-IRgmiaqj Or metabolic syndrome phenotypes and the 19 metabolite 
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clusters making up the IR- and IS-metabotypes. The colour represents 
positive (blue) or negative (red) correlations and FDRs are denoted: 

+, FDR <0.1; *, FDR< 0.01; **, FDR < 0.001. The left sidebar represents 
positive (blue) or negative (red) correlations between the species and the 
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Extended Data Figure 6 | Abundances of P. copri and B. vulgatus non-diabetic individuals arranged by decreasing P. copri abundance and 
anti-correlate and their combined abundance correlates with HOMA-IR __ increasing B. vulgatus abundance (a), and arranged by total abundance of 
in 277 non-diabetic individuals. a, b, The abundances of T2DCAG00385: both species with HOMA-IR levels shown above (b). 
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Extended Data Figure 7 | Correlations between microbial species and 
both HOMA-IR and the BCAA-containing metabolite cluster (M10) 
in 277 non-diabetic individuals. a, b, Spearman correlations between 
species and both the BCAA-containing metabolite cluster (a, M10) 

and insulin resistance (b, HOMA-IR) in individuals with detectable 
abundances of the respective species. FDRs of 0.1 and 0.05 are denoted 
with dotted and dashed lines, respectively. Colour intensity represents 
mean species abundance in individuals where the species was observed. 
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faecal samples from mice after three weeks of treatment with P.copri 
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Replication fork stability confers 
chemoresistance in BRCA- deficient cells 


Arnab Ray Chaudhuri!*, Elsa Callen!*, Xia Ding’, Ewa Gogola®, Alexandra A. Duarte, Ji-Eun Lee*, Nancy Wong}, 

Vanessa Lafarga°, Jennifer A. Calvo®, Nicholas J. Panzarino®, Sam John!, Amanda Day’, Anna Vidal Crespo!, Binghui Shen’, 
Linda M. Starnes’, Julian R. de Ruiter’, Jeremy A. Daniel®, Panagiotis A. Konstantinopoulos®, David Cortez!°, Sharon B. Cantor®, 
Oscar Fernandez-Capetillo®, Kai Ge*, Jos Jonkers*, Sven Rottenberg?"", Shyam K. Sharan? & André Nussenzweig! 


Cells deficient in the Brcal and Brca2 genes have reduced capacity to repair DNA double-strand breaks by homologous 
recombination and consequently are hypersensitive to DNA-damaging agents, including cisplatin and poly(ADP-ribose) 
polymerase (PARP) inhibitors. Here we show that loss of the MLL3/4 complex protein, PTIP, protects Brcal/2-deficient 
cells from DNA damage and rescues the lethality of Brca2-deficient embryonic stem cells. However, PTIP deficiency does 
not restore homologous recombination activity at double-strand breaks. Instead, its absence inhibits the recruitment of 
the MREII nuclease to stalled replication forks, which in turn protects nascent DNA strands from extensive degradation. 
More generally, acquisition of PARP inhibitors and cisplatin resistance is associated with replication fork protection in 
Brca2-deficient tumour cells that do not develop Brca2 reversion mutations. Disruption of multiple proteins, including 
PARP1 and CHD4, leads to the same end point of replication fork protection, highlighting the complexities by which 
tumour cells evade chemotherapeutic interventions and acquire drug resistance. 


The role of BRCA1 and BRCA2 in the repair of double-strand breaks 
(DSBs) is thought to be central to their tumour-suppressor activities, 
and underlies the hypersensitivity of Brca-deficient cells to DNA- 
damaging agents. While cisplatin and PARP inhibitors (PARPi) have 
been shown to be effective chemotherapeutic agents, most Brca-mutant 
carcinomas acquire resistance’. Besides reduced uptake and increased 
efflux of drugs, the most well-described mechanism that drives 
chemotherapeutic resistance in Brca1/2-deficient tumours is through 
the restoration of homologous recombination (HR)!. Identification 
of additional mechanisms underlying resistance to DNA damage is 
crucial for improving therapies and predicting tumour responses in 
Brca-deficient cancers. 


PTIP loss protects RFs in Brca-deficient cells 

In addition to their roles in HR, recent studies have uncovered DSB- 
independent functions for BRCA1 and BRCA2 during replication 
stress, Since MRE11 has been implicated in mediating replication 
fork (RF) degradation in cell lines**, we tested whether primary cells 
deficient in BRCA1 or BRCA2 also showed degradation of nascent 
replication tracts. We therefore conditionally inactivated BRCA1 and 
BRCA2 in B lymphocytes (Breal“CD19Cre; Brca2/Cd19Cre). B cells 
were sequentially labelled with CldU-(red) followed by IdU-(green), 
after which the active RFs were stalled with hydroxyurea (HU) (Fig. 1a). 
The relative shortening of the IdU tract after HU treatment serves as a 
measure of RF degradation (Fig. 1a). Upon HU treatment, wild-type 
(WT) cells showed a mean IdU/CldU tract ratio close to 1 (Fig. 1b). 
However, Brcal- and Brca2-deficient B cells exhibited a 30-45% reduc- 
tion in the IdU tract length (Fig. lb-e and Extended Data Fig. la-c). 


Consistent with previous data”?, RF degradation in B lympho- 
cytes was dependent on MRE11 exonuclease activity (Extended Data 
Fig. la—c). We also tested the role of DNA2 and the Werner syndrome 
helicase/nuclease (WRN) in degradation of forks in Brca2-deficient B 
cells. Treatment of Brca2-deficient cells with WRN inhibitor did not 
result in fork protection, whereas MRE11 and DNA2 were epistatic 
(Extended Data Fig. 1c). 

Since 53BP1, RIF1, and PTIP counteract BRCA1-dependent HR by 
inhibiting MRE11-dependent DSB resection’ !°, we examined whether 
these factors might also function in RF stability. We thus inactivated 
BRCA1, BRCA2, RIF1, PTIP and 53BP1 in B lymphocytes (Extended 
Data Fig. 2a). Short exposure to HU did not promote significant DSB 
formation (Extended Data Fig. 2b), and fork progression rates were 
comparable across all genotypes (Extended Data Fig. 2c-f). 

Absence of 53BP1 did not protect Brca1-deficient B cells from deg- 
radation of RFs (Fig. 1b), consistent with the finding that BRCA1 
acts in RF stabilization in a manner independent of DSB repair”. 
Nascent strands also shortened considerably in the absence of the 
53BP1 effector Rifl and in Brcal/Rif1 doubly deficient cells (Fig. 1c). 
In striking contrast, loss of Ptip protected RFs from HU-induced 
degradation in both Brcal- and Brca2-deficient cells (Fig. 1d, e). 
Moreover, while Brcal~’~, Rifl ~~ and Brcal Rif ~~ B cells 
displayed increased genomic instability when treated with HU 
(Extended Data Fig. 3a), Brcal/Ptip doubly deficient cells exhibited 
2.4-fold fewer chromosomal aberrations and increased viability com- 
pared with Brcal~’~ (Fig. 1f). Similarly, loss of Ptip decreased the 
number of chromosomal aberrations in Brca2~’~ cells challenged with 
HU (Extended Data Fig. 3b), suggesting that PTIP has functions at 
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Figure 1 | Loss of Ptip in Brca1/2-deficient B cells protects nascent 
DNA from degradation without restoring HR. a, Schematic for labelling 
B cells with CldU and IdU. b-e, Ratio of IdU versus CldU upon HU 
treatment. Numbers in red indicate the mean + s.d. (NS, not significant, 
P< 0.0001, Mann-Whitney test). One hundred and twenty-five 
replication forks were analysed for each genotype. f, Genomic instability 


stalled RFs distinct from its DSB-dependent interactions with 53BP1 
and RIF1. 

We hypothesized that HU-induced degradation would impact RF 
progression rates. We therefore assayed the ability of WT and mutant 
cells to incorporate nucleotide analogues in the presence of low con- 
centrations of HU. We observed a significant decrease in IdU tract 
lengths during HU exposure across all genotypes. However, Brea2- 
deficient cells had significantly decreased progression rates upon HU 
treatment, whereas Ptip-’~ and Brca2~‘~Ptip ‘~ cells displayed signif- 
icantly longer replication tracts (Extended Data Fig. 3c). We also tested 
the effect of Ptip-deficiency on recovery after replication stalling with 
high concentrations of HU. We found that although the percentage of 
restarted RFs did not change among genotypes (Extended Data Fig. 3d), 
loss of Brca2 resulted in a delayed restart, whereas Brca2/Ptip doubly 
deficient cells restarted normally (Extended Data Fig. 3e). Thus, loss 
of PTIP promotes RF progression and timely restart in Brca2-deficient 
cells, which correlates with decreased RF degradation. 


PTIP loss rescues lethality of Brca2-null ESCs 

Since elevated levels or stabilized RAD51 filaments could protect RFs 
from degradation”?"*, we asked whether PTIP deficiency leads to over- 
expression of RAD51 or enhanced RADS51 activity. RAD51 levels were 
similar in WT, Ptip~/ ~, Brca2~/—, and Brca2~/’ “Ptip ~~ cells (Extended 
Data Fig. 3f), but the ability of RAD51 to relocalize to sites of DNA 
DSBs was severely impaired in Brca2/Ptip-deficient B cells (Fig. 1g and 
Extended Data Fig. 3g) and embryonic stem cells (ESCs) (see Fig. 2c, d). 
Moreover, loss of Ptip did not enhance the loading of RAD51 on 
nascent chromatin (see Fig. 3f). 

Loss of Brca2 in ESCs is incompatible with cell survival'’. To test 
whether PTIP deficiency could promote ESC survival we knocked 
down PTIP in PL2F7 mouse ESCs, which have one null and one 
conditional allele of Brca2 (Brea2!-, Fig. 2a and Extended Data 
Fig. 4a)!”. After CRE transfection in Brca2/~ ESCs and selection in 
HAT medium, very few resistant colonies were obtained and these 


(top) and viability upon HU treatment (bottom) relative to WT upon 

6h of 10mM HU treatment. (**P < 0.001, *P < 0.05, unpaired t-test.) 
Fifty metaphases were analysed. g, Representative images (top) and 
quantification (below) of irradiation (IR)-induced RAD51 foci. *P < 0.05, 
unpaired t-test (n = 120 cells examined). Experiments were repeated three 
times. 


remained Brca2’~ rather than Brca2-null, reflecting the essential role 
of BRCA2 in ESC viability (Fig. 2b)’. Strikingly, 12.5% and 5% of the 
HAT-resistant colonies were Brca2-null when targeted by Ptip short 
hairpin RNAs (shRNAs) | and 2 respectively (Fig. 2b and Extended 
Data Fig. 4b). Consistent with our analysis of Brca2~/~Ptip ‘~ B cells 
(Fig. 1g), irradiation-induced RAD51 foci formation was defective in 
Brca2/Ptip-deficient ESCs (Fig. 2c, d). Moreover, while efficient gene 
targeting to the Pim-1 locus was observed in WT ESCs using a pro- 
moterless hygromycin cassette (100% of the hygromycin-resistant 
WT clones were targeted integrations), we did not observe a single 
targeted clone in Brca2/Ptip-deficient ESCs (Fig. 2e) indicative of 
defective HR. Similarly, the synthetic HR reporter substrate DR-GFP 
revealed impaired HR in Brca2/Ptip-deficient ESCs (Extended Data 
Fig. 4c). Nevertheless, stalled RFs in PTIP-deficient Brca2 ~~ ESCs 
displayed RF protection compared with Brca2 hypomorphic mutant 
ESCs (Y3308X)"” (Fig. 2f). Thus, deficiency in PTIP protects RFs from 
degradation and rescues the lethality of Brca2 knockout ESCs without 
restoring DSB-induced HR. 


BRCA2 is dispensable for HR at RFs 

It has been suggested that HR at stalled forks is regulated differently 
from HR at DSBs!*®. As a readout for HR at RFs, we assayed for sister 
chromatid exchanges (SCEs) in WT and Y3308X ESCs. Although 
Y3308X cells show undetectable levels of irradiation-induced RAD51 
formation and loss of targeted integration, indicative of a defect in 
DSB-induced HR", the basal frequency of SCE was normal in Y3308X 
cells (Extended Data Fig. 4d). Moreover, RAD51 was enriched on nas- 
cent DNA in Y3308X during normal replication and in presence of 
HU as measured by isolation of proteins on native DNA (iPOND) 
analysis (Extended Data Fig. 4e). We also observed similar frequen- 
cies of spontaneously generated and DNA damage-induced SCEs 
in WT, Brca2-null and Brca2/Ptip-deficient B cells (Extended Data 
Fig. 4f). Thus, in contrast to RAD51 which is required for DSB- and 
replication-associated HR', BRCA2 appears to be dispensable for HR 
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Figure 2 | PTIP deficiency rescues the lethality of Brca2-null mouse 
ESCs and confers fork protection. a, Schematic for deletion of Brca2. 
b, PCR genotyping of ESC clones (M, marker; P, positive control for 
Brca2/-;N, no DNA control). c, d, Representative images (c) and 
quantification (d) of irradiation-induced foci in shPtip and Brea2~/~/ 
shPtip ESCs (n= 110 cells examined). e, Representative Southern blot 
images (top) and quantification for targeting efficiency (bottom) for 
59xDR-GFP* gene targeting to the Pim-1 locus. f, Ratio of IdU versus 
CldU (*#***P < 0.0001, Mann-Whitney test). One hundred and 
twenty-five replication forks were analysed. 


that uses the nascent sister chromatid to repair DNA lesions during 
replication. 


MREII association at RFs depends on PTIP-MLL3/4 
Although the recruitment of PTIP to DSBs after irradiation is depend- 
ent on 53BP1 (Extended Data Fig. 5a, b)!°, we hypothesized that PTIP 
might be recruited to sites of stalled RFs independently of its inter- 
actions with 53BP1. Consistently, we observed PTIP accumulation at 
sites of replication stalling marked by pan-nuclear +-H2AX staining””. 
Among cells exhibiting pan )-H2AX signal, 71.4% of WT and 78% of 
53Bp1~/~ mouse embryonic fibroblasts (MEFs) exhibited PTIP foci 
following HU treatment (Fig. 3a and Extended Data Fig. 5c). Even in 
the absence of HU treatment, PTIP exhibited extensive co-localization 
with proliferating cell nuclear antigen (PCNA) during late S phase 
both in WT and in 53Bp1-deficient cells (Fig. 3b and Extended Data 
Fig. 5d, e). Collectively, these data suggest that PTIP might func- 
tion during normal or perturbed replication in a DSB- and 53BP1- 
independent manner. 

Like PTIP, MRE11 also associates with chromatin in a DNA damage- 
independent but DNA replication-dependent manner”! (Fig. 3c). Loss 
of PTIP resulted in a marked decrease of MRE11 association with 
PCNA foci in unperturbed cycling MEFs (Fig. 3c) and defective MRE11 
recruitment to ssDNA regions upon HU treatment (Extended Data 
Fig. 5f). Re-introduction of WT full-length PTIP into Ptip-’~” MEFs 
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Figure 3 | PTIP localizes to sites of replication and recruits MRE11 

to active and stalled replication forks. a, WT and 53Bp1 ~'~ MEFs 
expressing GFP-tagged PTIP were either treated or not (NT) with 

4mM HU and assessed for \-H2AX (red) and PTIP (green). DAPI 
(4',6-diamidino-2-phenylindole) indicated in blue. Quantification in 
Extended Data Fig. 5c. b, Co-localization of PCNA (magenta) and PTIP 
(green). Quantification in Extended Data Fig. 5d. c, Co-localization of 
PCNA (green) and MRE11 (red). Quantification in lower panel (n = 150 
cells examined). d, Ptip~/~ MEFs infected with either empty vector 

(EV, containing IRES-GFP) or full-length PTIP (GFP-PTIP) and probed 
for GFP (green), MRE11 (red) and PCNA (magenta). Quantification in 
lower panel (n= 150 cells examined). e, MRE11 (red) and )-H2AX (green) 
irradiation-induced foci. Quantification in Extended Data Fig. 5g. 

f, iPOND analyses of proteins at replication forks (capture). Input 
represents 0.25% of the total cellular protein content. RAD51 and MRE11 
levels (shown below) were normalized to total H3. Experiments were 
repeated three times. 


restored MRE11 co-localization with PCNA in late S phase (Fig. 3d). 
Thus, in contrast to irradiation-induced MRE11 foci, localization of 
MREII to sites of DNA replication is PTIP-dependent (Fig. 3c—-e and 
Extended Data Fig. 5g). 

To monitor MRE11 and RADS1 association with active and stalled 
RFs we performed iPOND analysis in WT and Ptip ’~ MEFs (Extended 
Data Fig. 5h)”. WT cells showed an increase in MRE11 and RAD51 
association with stalled RFs (Fig. 3f). Ptip-deficient cells also showed 
an increase in RAD51 association at RFs, but MRE11 association with 
nascent DNA was reduced upon HU treatment (Fig. 3f), consistent 
with our immunofluorescence analysis (Fig. 3c and Extended Data 
Fig. 5f). Thus, MRE11 deposition on newly synthesized DNA is 
dependent on PTIP, which itself is recruited to stalled forks upon HU 
treatment (Extended Data Fig. 5i)”. 
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PTIP is also known to constitutively associate with PA1 and with 
MLL3/MLL4 histone methyltransferases which catalyse methylation 
of histone H3 at lysine 4 (refs 23, 24). To identify the region of PTIP 
that promotes RF degradation in Brca2-deficient cells, we expressed EV 
(empty vector), FL (full-length PTIP), W165R (disrupting interactions 
with PA1)?>°, W663R (disrupting interactions with 53BP1 at DSBs)” 
or Del-BRCT5-6 (disrupting interaction with MLL3/4 independently of 
DSBs)**"46 in Brca2/Ptip doubly deficient cells. We observed that only 
reconstitution of Brca2/Ptip-deficient cells with PTIP-Del-BRCT5-6 
maintained fork protection (Extended Data Fig. 6a). 

We therefore tested whether the recruitment of MRE11 at stalled 
forks was dependent on MLL3/4. We observed that MRE11 associa- 
tion at RFs was dependent on MLL3/4 as monitored by iPOND and 
immunofluorescence analysis (Extended Data Fig. 6b, c). We also 
observed an enrichment of H3K4mel1 and H3K4me3 at nascent forks 
upon HU treatment that was PTIP- and MLL3/4-dependent (Fig. 3f 
and Extended Data Fig. 6b). Thus, deposition of MRE11 on newly 
synthesized or stalled chromatin correlates with the establishment of 
H3K4mel and H3K4me3 at RFs. 

To determine whether MLL4 contributes to degradation of stalled 
forks in Brca-deficient cells, we examined RF degradation in Brcal ae 
MIl4~~ and Brca2~/~MIll4~’~ B cells. Brcal~/~Mll4~/~ and Brea2~/~ 
MIl4~/~ cells displayed a partial rescue of fork degradation (Extended 
Data Fig. 6d, e). To test whether MLL4 methyltransferase activity is crit- 
ical, we targeted the catalytic SET domain of MLL4 in Brca1-deficient B 
cells. We observed a significant rescue of fork degradation in Brcal~/~ 
MIl4-SET ‘~ cells, suggesting that the methyltransferase activity is 
important for promoting fork degradation (Extended Data Fig. 6f). 


RF protection confers chemoresistance 

RF protection contributes to genome stability in a manner independ- 
ent of DSB-induced HR?*». Consistently, we observed that Brca2~/~ 
MIl4~’~ B cells showed a partial rescue of chromosomal aberrations 
upon PARPi and cisplatin treatment compared with Brca2~‘~ cells 
alone (Extended Data Fig. 6g). However, Rif1-deficient cells, charac- 
terized by extensive RF degradation (Fig. 1c) but normal irradiation- 
induced RAD51 foci formation (Extended Data Fig. 7a), showed 
increased chromosomal aberrations in response to replication poi- 
sons PARPi, HU and cisplatin (Extended Data Figs 3a and 7b, c)?”. 
We therefore speculated that RF stability mediated by loss of PTIP 
(Fig. 1d, e) might confer genome stability to Brca-deficient cells 
exposed to chemotherapeutics that poison DNA replication. Indeed, 
we found that Ptip deficiency reduced the levels of chromosomal aber- 
rations both in Brcal ~~ and in Brca2~‘~ B cells (Fig. 4a, b). 

To test whether differential levels of PTIP expression could be an 
indicator of patient responses to platinum chemotherapy, we que- 
ried clinical information from The Cancer Genome Atlas (TCGA) of 
patients with Brca1/Brca2-mutated ovarian serous adenocarcinoma 
treated with platinum chemotherapy (Extended Data Fig. 8a, b). 
Survival analysis demonstrated that platinum-treated Brca2 mutants 
with high PTIP expression were correlated with a longer progression- 
free survival (PFS) (Extended Data Fig. 8a). Lower expression of 
PTIP also predicted a shorter PFS in Brca2-associated ovarian cancers 
(Extended Data Fig. 8b). Taken together, these data suggest that PTIP 
levels could be a biomarker for acquired resistance to platinum-based 
chemotherapy in Brca1/2-mutated ovarian cancers. 

An unbiased shRNA screen recently demonstrated that reduced 
levels of the nucleosome remodelling factor CHD4 in Brca2 mutant 
cancers correlated with poor patient response to chemotherapy, and 
increased tolerance to DNA-damaging agents without restoration 
of RAD51-dependent HR**. To test whether the resistance mecha- 
nism in this case occurs through RF protection, we knocked down 
CHD4 in the Brca2 mutant ovarian cancer cell line PEO1 (Extended 
Data Fig. 8c, d). While CHD4 depletion in Brca2 mutant cells did not 
restore HR’®, we observed that it largely conferred protection to nas- 
cent replication tracts from degradation upon HU treatment (Fig. 4c). 
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Figure 4 | Replication fork protection confers genome stability and 
chemotherapeutic resistance. a, b, Genomic instability measured 

in metaphase spreads from B cells (n= 50; *P < 0.05, **P< 0.01, 

**%* P< 0.001, ****P < 0.0001, unpaired t-test). Experiments were 
repeated four times. c, Ratio of IdU versus CldU in BRCA2-mutated 
PEOI1 cells either mock (shNSC) infected or infected with shRNA against 
CHD4 (shCHD4) (*P < 0.05, ****P < 0.0001, Mann-Whitney test). 

One hundred and twenty-five replication forks were analysed. d, Ratio of 
IdU versus CldU in HU-treated B cells (*P < 0.05, ****P< 0.0001, 
Mann-Whitney test). One hundred and twenty-five replication forks 
were analysed. e, Genomic instability in B cells (n = 50; ***P < 0.001, 
unpaired t-test). Experiments repeated four times. f, g, Kaplan-Meier 
survival curves in mice implanted with either PARPi-naive or -resistant 
tumours and treated with either topotecan (f) or cisplatin (g) using 
log-rank (Mantel-Cox) test. h, Ratio of IdU versus CldU in untreated 

or HU-treated tumours (PARPi naive versus PARPi resistant) 

(#*** P< 0.0001, Mann-Whitney test). One hundred and twenty-five 
replication forks were analysed. 


Moreover, depletion of CHD4 resulted in significantly decreased 
recruitment of MRE11 upon HU treatment in Brca2 mutant cells 
(Extended Data Fig. 8e, f). 

These results suggest that MRE11-mediated degradation contributes 
to genome instability upon treatment with replication poisons. To test 
this idea, we pre-incubated Brca2-deficient B cells with the MRE11 
nuclease inhibitor mirin before treating with PARP inhibitor or cis- 
platin. Mirin treatment did not alter the frequency of replicating cells 
monitored by EdU incorporation (Extended Data Fig. 8g). However, 
incubation with mirin reduced the levels of PARPi-induced chromo- 
somal aberrations approximately twofold. Similarly, Brca2-deficient 
cells were partly protected from cisplatin-induced DNA damage 
when MREI1 nuclease activity was inhibited (Extended Data Fig. 8h). 
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We conclude that MRE11 nuclease promotes genomic instability in 
Brca2-deficient cells. 

Since PARP1 (ARTD1) is required for MRE11 localization to stalled 
replication forks” and its loss rescues the embryonic lethality in Brea2- 
null ESCs*°, we tested the contribution of PARP1 to RF stability and 
genome integrity by generating Brcal~/~Parp1~’~ B cells (Brcal/, 
Parp1~'~; Cd19Cre). Interestingly, loss of Parp1 protected Brcal- 
deficient RFs from degradation and resulted in a significant reduction 
in chromosomal aberrations (Fig. 4d, e). Nevertheless, Parp1 defi- 
ciency failed to rescue irradiation-induced RAD51 foci formation in 
Brca1-deficient cells (Extended Data Fig. 8i). Thus, despite the fact 
that treatment with PARP inhibitor increases levels of DNA damage 
in Brca1-deficient cells*!*”, loss of Parp1 before Brca1 loss protects 
against genome instability. 

To determine whether Brca2-deficient tumour cells acquire chemo- 
therapy resistance via RF protection, we induced PARPi-resistance 
using the KB2P mouse model for Brca2-deficient breast cancer 
(Extended Data Fig. 9a)**. A mammary tumour from KB2P mice was 
transplanted into syngeneic FVB mice and, when the tumour reached 
a size >200 mm’, mice were treated with PARPi (AZD2461) for 
28 consecutive days (Extended Data Fig. 9b, c). The tumour was initially 
responsive to treatment but eventually grew, upon which treatment was 
repeated until PARPi resistance was achieved (Extended Data Fig. 9c). 
The stability of acquired resistance was confirmed by re-transplanting 
matched naive and resistant tumours and treating the animals with 
vehicle or AZD2461 (Extended Data Fig. 9d). PARPi-resistant tumours 
also showed cross-resistance to replication poisons topotecan and 
cisplatin (Fig. 4f, g). 

Both naive and resistant Brca2-deficient tumours showed impaired 
irradiation-induced RAD51 foci formation (Extended Data Fig. 9e, f). 
We therefore assayed naive and resistant tumours ex vivo for RF sta- 
bility. The mean length of the CldU and the IdU tracts were similar in 
all samples that were not treated with HU (Fig. 4h and Extended Data 
Fig. 9g). While naive tumour cells showed degradation of nascent tracts 
upon HU treatment, resistant tumour cells were protected (Fig. 4h). 
These data suggest that RF protection rather than restoration of HR 
may be the main mechanism for acquired resistance in this mammary 
tumour. However, in this case, acquired resistance was not simply due 
to loss of PTIP or MRE11 proteins (Extended Data Fig. 9h). 


Discussion 

Our study and the accompanying paper*” provide the first examples 
of genetic alterations that bypass the essential requirement for DSB- 
induced HR, evidenced by the finding that reduction in PTIP or PARP1 
(ARTD1) levels rescues the lethality of Brca2-null ESCs. We specu- 
late that the reason Brca2 nullizygosity is compatible with viability is 
because RADS1 is able to perform essential HR functions (such as SCE) 
by loading onto RFs independently of BRCA2 (Extended Data Fig. 4e), 
even if RAD51 loading onto processed DSBs is strictly dependent on 
BRCA2 (Fig. 1g and Fig. 2c, d). We also show that loss of PTIP, PARP1 
and CHD4 confers resistance to a variety of DNA-damaging agents in 
Brca-deficient cells. Resistance to PARPi and cisplatin both in primary 
and in tumour cells grown in vitro or in vivo correlates with protection 
from RF degradation. While 53BP1 disruption rescues the viability of 
Brcal-deficient mice by restoration of HR®, cells from these animals 
show residual levels of chromosomal aberration. The significant level 
of RF degradation observed in Brcal~’~53Bp1~’~ cells (Fig. 1b) may 
contribute to this genome instability. 

Since RF degradation is mediated by MRE11, we propose that persis- 
tence of MRE11 at stalled forks is toxic and normally counteracted by 
BRCA1/2. One possibility is that MRE11 could initially be recruited to 
stalled forks to mediate RF restart*, and that BRCA1/2 is necessary to 
subsequently disengage MRE11 from already-processed DNA termini. 
Consistently, we have found a significant increase in chromatin-bound 
MRE11 in Brca2-deficient cells treated with HU or cisplatin (Extended 
Data Fig. 10 a, b)*. In Saccharomyces cerevisiae, failure to remove 
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MRE11 from single-stranded DNA can lead to hypersensitivity to 
a variety of clastogens***°. We therefore propose that deficiencies 
in PTIP, CHD4 and PARP1 could confer drug resistance in Brca- 
deficient cells by limiting the access of MRE11 to single-strand DNA at 
stalled RFs. 

In summary, we have shown that protection of nascent DNA from 
degradation provides a mechanism that can promote synthetic viabil- 
ity and drug resistance in Brca-deficient cells without restoring HR 
at DSBs. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 
Mice, MEBs, and B cell cultures. The 53Bp1~/~ (ref. 37), Brea (AIVfAI) 
(NCI mouse repository), Brca2ff (NCI mouse repository), Rif (ref. 27), Ptip!! 
(ref, 24) and MIl4” (ref. 38) mice have been described. To generate MIl4-SET-flox/ 
flox mice, exons 50-51 of the Mll4/Kmt2d gene were flanked by loxP sites. Cre- 
mediated deletion of the floxed exons 50-51 causes frame shift and generates a stop 
codon in exon 52. The resulting protein lacks the carboxy (C)-terminal 276 amino 
acids including the entire enzymatic SET domain. Resting splenic B cells were 
isolated from 8- to 12-week-old WT or mutant spleen with anti-CD43 microbeads 
(anti-Ly48; Miltenyi Biotech) and were cultured with LPS (25 1g/ml; Sigma), IL-4 
(5ng/ml; Sigma) and RP105 (0.5 j1g/ml; BD) as described”. Stimulated B cells were 
additionally infected with CRE to ensure a high level of gene deletion in these cells. 
All mice were randomly distributed in experimental groups after genotyping. WT 
MEFs were immortalized by SV40 retroviral infection, and SV40 immortalized 
Ptip!! (ref. 39) and MII3 ~/~MIl4f MEFs** were infected with a CRE retrovirus 
to delete Ptip and MIl4 respectively. Coding sequences for mouse Ptip-GFP were 
cloned into the MIGRI retroviral vector as previously described”. All cell lines 
used were tested for mycoplasma using a Mycoplasma Detection Kit (Invitrogen) 
and relevant cell lines were authenticated either by genotyping or western blot 
analysis. PARP inhibitor (KU58948) was obtained from AstraZeneca. Mirin was 
obtained from Sigma, PFM39 and WRN inhibitors were gifts from J. Tainer and 
R. Brosh respectively. DNA2 inhibitor has been described“. Cisplatin and HU 
were obtained from Sigma. For FISH analysis, metaphases were prepared and 
imaged as described’. Animal experiments were approved by the Animal Care 
and Use Committee of NCI-Bethesda and the Animal Ethical Committee of The 
Netherlands Cancer Institute. 
Generation of CHD4-deficient PEO1 lines using RNAi. BRCA2-mutated ovarian 
cancer cell lines PEO1 were grown in DMEM with 10% FBS and 1% Glutmax. 
RNAi for CHD4 was performed as previously described”*. 
ESC lines, shRNA knockdowns and rescue of BRCA2 viability. PL2F2 mouse 
ESCs were maintained in M15 media (Knockout DMEM with 15% fetal bovine 
serum, 0.00072% B-mercaptoethanol, 100 U/ml penicillin, 100 j1g/ml strep- 
tomycin and 0.292 mg/ml L-glutamine) at 37°C, 5% CO. ESCs were plated 
on a monolayer of mitotically inactive feeder cells as described previously’. 
BRCA2 conditional knockout ESCs were generated as previously described!’. 
Two different shRNAs against Ptip mRNA were cloned into pSuperior vector 
(Oligoengine). shRNA sequences are listed below with the targeted sequences 
underlined: mPtip shRNA-1 sense, 5'GATCCCC GIGGCGCTCTCCTGCCAGT 
TTCAAGAGA ACTGGCAGGAGAGCGCCAC TTTTTA 3/; mPtip shRNA-1 
antisense, 5’ AGCTTAAAAA GITGGCGCTCTCCTGCCAGT TCTCTTGAA 
ACTGGCAGGAGAGCGCCAC GGG 3’; mPtip shRNA-2 sense, 5'GATCCCC 
CCGTAGCAACACAGTCCTC TTCAAGAGA GAGGACTGTGTTGCTACGG 
TTTTTA 3’; mPtip shRNA-2 antisense, 5’ AGCTTAAAAA CCGTAGCAACAC 
AGTCCTC TCTCTTGAA GAGGACTGTGTTGCTACGG GGG 3’. 

shRNA vectors were linearized by Scal and electroporated into ESCs using a 
Bio-Rad gene pulser. Cells were selected in G418 (0.18 mg/ml) 24h after elec- 
troporation for 5 days. Individual colonies were picked and PTIP levels were 
determined by western blotting. The PGK-Cre plasmid was electroporated 
into Ptip shRNA knockdown cells (201g DNA for 1 x 107 cells). Thirty-six 
hours after electroporation, HAT (hypoxanthine-aminopterin-thymidine) 
selection was performed for 5 days, after which cells were switched to HT 
(hypoxanthine-thymidine) media for 2 days and then transferred to nor- 
mal media. Colonies were picked when visible, genomic DNA was extracted 
and Southern blotting was performed as described previously!”. For PCR gen- 
otyping, the following primers were used to amplify the Brca2 cko allele and 
Brca2-KO allele: Brca2-KO forward, 5’GCCACCTCTGCCTGATTCTA; Brca2-KO 
reverse, 5’ AAAGAACCAGCTGGGGCTCGAG; Brca2-flox forward, 5’ TG 
AAGTGGACCCTGTAACCC; Brca2-flox reverse, 5’ AGTTCTCTCCTTTCA 
GCCTTCT. 
Gene targeting and the gene conversion HR assay. For gene targeting, ESCs were 
electroporated with 751g of linear targeting fragment from p59xDR-GFP, followed 
by selection for 5 days in 110j1g/ml hygromycin; then they were allowed to grow in 
normal ESC media*®. Colonies were picked, and targeted clones were identified by 
Southern hybridization as described*°. For the DSB-induced HR assay, ESCs were 
transiently electroporated with 201g of either pDR-GFP only or pDR-GFP plus 
I-Scel expressing vectors for 48 h. Gene conversion was read-out as the percentage 
of GFP-positive cells by flow cytometry analysis“). 
DNA fibre analysis. Asynchronous B cells, BRCA2-mutated PEO1 cells and 
Brca2-deficient K2BP tumours were labelled with 30 1M CldU, washed with 


PBS and exposed to 250\1M IdU. After exposure to IdU, the cells were washed 
again in warm PBS and treated or not with HU before collection. Cells were 
then lysed and DNA fibres stretched onto glass slides, as described*’. The 
fibres were denatured with 2.5 M HCl for 1h, washed with PBS and blocked 
with 2% BSA in phosphate buffered saline Tween-20 for 30 min. The newly 
replicated CldU and IdU tracts were revealed with anti-BrdU antibodies rec- 
ognizing CldU and IdU respectively. Images were taken at 60 x magnification 
(Zeiss Axio Observer.Z1), and statistical analysis was performed using GraphPad 
Prism. 

DSB detection by PFGE. DSB detection by PFGE was done as described’. 
Ethidium-bromide-stained gels were analysed using an UVP EC3 imaging 
system. Quantification was performed using ImageJ normalizing DSB signals 
to unsaturated DNA signals in the well. Relative DSB levels were obtained by 
comparing treatment results to the background DSB signals observed for 
untreated conditions. 

Western blotting and immunofluorescence. Primary antibodies were used 
at the following dilutions: anti-tubulin (1:15,000, Sigma), anti-H3K4me3 
(1:5,000, Millipore), anti-H3K4mel (1:5,000, Millipore), anti-RAD51 (1:50, 
Santa Cruz), anti-PCNA (1:2,000, Santa Cruz), anti-total H3 (1:1,000 Millipore) 
and anti-Mre11 (1:10,000, gift from J. Petrini, MSKCC). MEFs were prepared 
for immunofluorescence by growth on 18mm x 18mm glass cover slips. 
Lymphocytes were dropped onto slides coated with CellTak (BD Biosciences). 
Cells were fixed with methanol and incubated with primary antibody as 
indicated. 

iPOND and flow cytometry. iPOND was performed essentially as described””. 
One hundred and fifty million WT, Ptip~/~ and MII3~/~MIll4~/~ MEFs were 
labelled with 10j1M EdU (Life Technologies) and treated with HU as indicated. 
Two hundred million cells were use for iPOND experiments with ESCs. Cells were 
cross-linked with 1% formaldehyde for 10 min at room temperature, quenched 
with 0.125 M glycine and washed with PBS. For the conjugation of EdU with 
biotin azide, cells were permeabilized with 0.25% Triton X-100/ PBS, and incu- 
bated in click reaction buffer (10 mM sodium-t-ascorbate, 20 1M biotin azide 
(Life Technologies), and 2mM CuSO4) for 2h at room temperature. Cells were 
resuspended in lysis buffer (50 mM Tris-HCl, pH 8.0, and 1% SDS) supplemented 
with protease inhibitors (Roche), and chromatin was solubilized by sonication 
in a Bioruptor (Diagenode-Pico) at 4°C for 24min. After centrifugation, clari- 
fied supernatants were incubated for 1h with streptavidin-MyOne Cl beads (Life 
Technologies). Beads were washed and captured proteins were eluted by boiling 
beads in 2 x NuPAGE LDS Sample Buffer (Life Technologies) containing 200 mM 
DTT for 40 min at 95°C. Proteins were resolved by electrophoresis using NuPAGE 
Novex 4-12% Bis-Tris gels and detected by western blotting with the indicated 
antibodies. For flow cytometric analysis, asynchronous B cells were pulsed with 
10mM EdU for 20 min at 37°C and stained using the Click-IT EdU Alexa Fluor 
488 Flow Cytometry Assay Kit according to the manufacturer’s specifications 
(ThermoFisher C-10425). 

Generation of AZD2461-resistant KB2P tumours and in situ RADS51 assay. 
Brca2~/ ~;p53 l= mammary tumours were generated in K14-cre;Brca2 Hs, p53 df 
(KB2P) female mice and genotyped as described previously**. Orthotopic trans- 
plantations into WT FVB/N/OLA (F1) mice, tumour monitoring, and sampling 
were conducted as described*’. Starting from 2 weeks after transplantation, 
tumour size was monitored at least three times a week. All treatments were 
started when tumours reached a size of approximately 200 mm?. AZD2461 
(100 mg/kg) was given orally for 28 consecutive days. If tumours did not shrink 
below 100% of the initial volume, treatment was continued for another 28 days. 
AZD2461 was diluted in 0.5% w/v hydroxypropyl methylcellulose in deionized 
water to a concentration of 10 mg/ml. For testing cross-resistance, mice were 
given a single treatment regimen of topotecan (2 mg/kg intraperitoneally, days 
0-4 and 14-18) or cisplatin (Mayne Pharma, 6 mg/kg intravenously, day 0). 
Statistical analysis was performed using GraphPad Prism (log-rank Mantel- 
Cox test). The in situ RADS1 irradiation induced formation assay has been 
described"*. 

Statistics. Statistics was performed by two-tailed t-test, Mann-Whitney test or 
by log-rank Mantel-Cox test unless otherwise specified. Statistical tests were 
justified appropriate for every figure. The data were normally distributed and 
the variance between groups being statistically compared was similar. No statis- 
tical methods or criteria were used to estimate sample size or to include/exclude 
samples. The investigators were not blinded to the group allocation during the 
experiments; however, the samples were coded before analysis unless otherwise 
specified. 
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Extended Data Figure 1 | Fork degradation in Brca-deficient B of IdU versus CldU upon HU treatment in Brca2~/~ B lymphocytes 
lymphocytes is mediated by MRE11 exonuclease activity. a, b, Ratio with or without mirin, PFM39 (MRE11 exonuclease inhibitor), PC5 
of IdU versus CldU upon HU treatment in WT, Brcal~/~ (a) and WT, (DNA2 inhibitor) or WRNi pre-treatment. Numbers in red indicate the 
Brca2~/~ (b) B lymphocytes with or without mirin pre-treatment. mean + s.d. for each sample (**P < 0.05, ****P < 0.0001, Mann-Whitney 
Numbers in red indicate the mean +s.d. for each sample (**P < 0.05, test). One hundred and twenty-five replication forks were analysed for 
*®* P< 0.001, ****P < 0.0001, Mann-Whitney test). One hundred and each genotype. 


twenty-five replication forks were analysed for each genotype. c, Ratio 
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Extended Data Figure 2 | Replication fork progression rates and 
DSBs in B lymphocytes. a, Quantificative PCR analysis for Brcal, 
Brca2, Ptip and Rif1 gene deletions in WT, Brcal ~~ Breal~/— Rifl == 
Breal~’~Ptip~/~, Brca2~/~ and Brca2~‘~ Ptip~/~ primary B lymphocytes 
after infection with CRE. b, PFGE analysis for detection of DSBs in 

WT, Brca2~/~ and Brca2~‘~ Ptip~/~ B lymphocytes treated with or 
without 4mM HU for 3h. Positive control for DSBs includes treatment 
of 15 Gy irradiation and HU +ATRi treatment for 3h. Quantification 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Loss of PTIP rescues fork progression and 
restart defects in Brca2-deficient B lymphocytes but does not affect 
RADS51 IRIE. a, Genomic instability measured in metaphase spreads from 
B lymphocytes derived from WT, Brcal ~~, Rifl~/~, Breal~/~ Rif1~/~ 

mice treated for 6h with 10 mM HU (*P < 0.05, unpaired t-test). Fifty 
metaphases were analysed per condition. Experiments were repeated 
three times. b, Genomic instability measured in metaphase spreads from 

B lymphocytes derived from WT, Brea2~/~, Ptip~/~, Brca2~'~ Ptip~’~ 

mice treated for 6h with 10 mM HU (*P < 0.05, unpaired t-test). Fifty 
metaphases were analysed per condition. Experiments were repeated three 
times. c, Fork progression in B lymphocytes derived from WT, Brca2 alae 
Ptip~/~, Brca2~/ Ptip~‘~ mice treated for 1h with 0.2 mM HU. The 

y axis represents the tract lengths in micrometres. Numbers in red indicate 
the mean +s.d. for each sample (****P < 0.0001, Mann-Whitney test). 
One hundred and fifty replication forks were analysed for each genotype. 
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d, Percentage of restarted replication forks in WT, Brca2 ar Ptip ai 
Brca2~‘~Ptip~/~ B cells treated for 1h with 1 mM HU followed by 90 min 
recovery. Two hundred replication forks were analysed for each genotype. 
e, Tract lengths of restarted replication forks in WT, Brca2 ae, Ptip man 
Brca2~/~Ptip~/~ B cells treated for 1h with 1 mM HU followed by 90 min 
recovery. The y axis represents the tract lengths in micrometres. Numbers 
in red indicate the mean + s.d. for each sample (NS, not significant, 
**** P< 0.0001, Mann-Whitney test). One hundred and fifty replication 
forks were analysed for each genotype. f, Western blot analysis for RAD51 
levels in WT, Ptip~/~, Brca2~/~ and Brca2~/~Ptip~/~ B cell extracts. 
Tubulin was used as loading control. g, Quantification of RAD51 foci 
formation in WT, Brca2~/~ and Brca2~/ ~Ptip ~~ B cells upon treatment 
with 5 and 10 Gy irradiation and recovery for 2, 4 and 6h (n= 150 cells 
analysed). 
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Extended Data Figure 4 | Depletion of PTIP rescues the lethality of 
Brca2-deficient ESCs. a, Western blot analysis for PTIP levels in WT 
and two different clones of Brca2~/~ ESCs electroporated with shPtip. 

b, Southern blot analysis for determination of Brca2 deletion in surviving 
clones electroporated with shPtip. Probes distinguishing the Brca2-flox 
allele (4.8 kb) (upper band) and Brca2 KO allele (2.2 kb) (lower band) 
were used. *Surviving ESC clones with Brca2 deletion and simultaneous 
downregulation of PTIP (12/96 Brca2-deleted colonies were found 

with shPtip1 and 3/60 colonies were found with shPtip2). Genotyping 
was confirmed by PCR (Fig. 2b). c, Top, representative FACS profiles 

of WT and Brca2~/~/shPtip ESCs electroporated with either pDR-GFP 
plasmid only (control) or pDRGFP and I-Scel expressing vector for 48 h. 
Gene conversion of the pDR-GFP construct by HR is determined by 

the percentage of GFP-positive cells (FL1, green-detection filter; FL2, 
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red-detection filter). Bottom, quantification of the percentage of GFP- 
positive cells in WT and Brca2~/~/shPtip ESCs across three independent 
experiments. NS, not significant, *P <0. 05, unpaired t-test. d, Sister 
chromatid exchange (SCE) analysis in WT and Brca2*3°X hypomorphic 
ESCs. Twenty metaphases were analysed per condition; experiments were 
repeated three times. e, WT and Brca2**°* hypomorphic ESCs were 
preincubated with mirin, EdU-labelled for 15 min and treated with 

4mM HU for 2h. Proteins associated with replication forks were isolated 
by iPOND and detected by western blotting with the indicated antibodies. 
f, SCEs in WT, Brca2~’~ and Brca2~/~ Ptip ~~ B cells either untreated 

or treated overnight with 11M PARPi or with 0.5 1M cisplatin (NS, not 
significant, *P < 0.05, **P < 0.001, unpaired t-test). Twenty metaphases 
were analysed per condition; experiments were repeated three times. 
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Extended Data Figure 5 | PTIP localizes to sites to DNA replication 
independently of DSBs. a, WT and 53Bp1~/~ MEFs were retrovirally 
infected with a GFP-tagged PTIP construct. Cells were then irradiated 
with 10 Gy and allowed to recover. Co-localization of \-H2AX (red) and 
PTIP (green) was assessed. Adjoining graph quantifies the percentage of 
cells with \-H2AX foci co-localizing with PTIP upon irradiation (n= 150 
cells analysed). Experiments were repeated three times. b, Western 

Blot analysis for endogenous and overexpressed PTIP levels in WT and 
53Bp1~'~ MEFs retrovirally infected with a GFP-tagged PTIP construct 
(GFP-PTIP). c, Quantification of the percentage of pan-nuclear y-H2AX- 
positive cells with PTIP foci in WT and 53Bp1~/~ MEFs upon treatment 
with HU (related to Fig. 3a) (n= 150 cells analysed). Experiments were 
repeated three times. d, Quantification of the percentage of cells with PTIP 
foci co-localizing with PCNA in WT and 53Bp1~/~ MEFs in late S phase 
(related to Fig. 3b) (n = 150 cells analysed). Experiments were repeated 
three times. e, Representative immunofluorescence images of PCNA and 
53BP1 co-staining in irradiated WT cells (10 Gy) or in late S-phase sites. 
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White arrows indicate that the few 53BP1 foci observed in late S-phase 
cells do not co-localize with PCNA. (1 =50 cells analysed). Experiments 
were repeated three times. f, Representative immunofluorescence images 
of WT and Ptip~/~ MEFs treated with 4 mM HU for 2 h and analysed 

for ssDNA (BrdU) and MRE11 co-localization. Bottom panels shows the 
quantification of BrdU-positive cells (left) and the percentage of MRE11 
co-localization in BrdU-positive cells (right) upon HU treatment in WT 
and Ptip-/~ MEFs. (n= 150 cells analysed). Experiments were repeated 
three times. g, Quantification of the percentage of cells with MRE11 foci 
co-localizing with \-H2AX upon irradiation treatment (10 Gy) in WT and 
Ptip~/~ MEFs (related to Fig. 3e) (n = 150 cells analysed). Experiments 
were repeated three times. h, Cell cycle profiles in WT and Ptip-/~ MEFs 
as measured by the incorporation of EdU (y axis) vs. DAPI (x axis). i, 
iPOND coupled to SILAC Mass-Spectrometry analysis for PTIP, H4 and 
RPA enrichment at stalled forks in 293T cells upon 3 mM HU treatment 
for 10 min and 4h (ref. 22). The y axes represent the relative abundance of 
the indicated proteins on a log scale. 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | MLL3/4 promotes replication fork 
degradation in Brca2-deficient cells. a, Top, schematic of the retroviral 
PTIP mutant constructs used to identify the domain of PTIP involved in 
driving replication fork degradation. Different BRCT domains in PTIP are 
numbered and Q represents the glutamine rich region between the second 
and the third BRCT domains. Bottom, ratio of IdU versus CldU upon 

HU treatment of Brca2~/~ Ptip~/~ B lymphocytes retrovirally infected 
with either EV, FL, W165R, W663R and Del BRCT 5-6 PTIP-mutant 
constructs and sorted for GFP or mCherry expression. Numbers in red 
indicate the mean + s.d. for each sample (**P < 0.01, ****P < 0.0001, 
Mann-Whitney test). One hundred and twenty-five replication forks 
were analysed for each condition. b, WT and MII3/4 ~/~ MEEs were EdU- 
labelled for 15 min and treated with 4 mM HU for 4h. Proteins associated 
with replication forks were isolated by iPOND and detected by western 
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blotting with the indicated antibodies. c, Quantification of the percentage 
of cells with MRE11 foci co-localizing with PCNA in late S phase in 

WT and MLL3/4~/~ MEFs (n= 150 cells analysed). Experiments were 
repeated three times. d-f, Ratio of IdU versus CldU upon HU treatment 
in WT, Brcal~/~, Brca2~/~ MIl4~/~, MIl4-SET ‘~, Brcal~/~Mll4‘-, 
Brea2~/~MIl4~’~ and Brcal~/~ MIl4-SET~/~ B cells. Numbers in 

red indicate the mean + s.d. for each sample (NS, not significant, 

we D <.0001, Mann-Whitney test). At least 125 replication forks were 
analysed for each genotype. g, Genomic instability measured in metaphase 
spreads from splenic B cells derived from WT, Brca2~‘~, MIl4~/-, 
Brca2~/~Mll4~/~ B cells treated overnight with 1 1M PARPi or with 0.5 1M 
cisplatin (**P < 0.01, ***P < 0.001, unpaired t-test). Fifty metaphases 
were analysed per condition. Experiments were repeated three times. 
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instability. a, Quantification of RAD51 foci formation in WT, Brcal / from WT, Brcal~/~ and Rifl /~ mice treated overnight with 1 1M 
and Rif ’~ B cells. Cells were treated with 10 Gy and harvested 4 h after PARPi (b) or with 0.5 1M cisplatin (c) (NS, not significant, **P < 0.01, 
irradiation (NS, not significant, *P< 0.05). At least 100 cells were analysed *** P< 0.001, ****P < 0.0001, unpaired t-test). Fifty metaphases were 
per condition; experiments were repeated three times. b, c, Genomic analysed per condition; experiments were repeated three times. 
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Extended Data Figure 8 | See next page for caption. 
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Extended Data Figure 8 | Multiple mutations can cause resistance 
to DNA-damaging agents in Brca-deficient cells. a, b, Difference in 
progression-free survival (PFS) of BRCA2- and BRCA 1-mutated ovarian 


serous adenocarcinoma patients with standard platinum-based regimens. 


Data were obtained from the TCGA project. Patients were separated 

into PTIP low- or high-expression on the basis of the 33rd percentile of 
PTIP expression z-scores. The difference between the PFS of PTIP-low 
versus PTIP-high was assessed by univariate log-rank P value (P < 0.072 
and P < 0.032 ina and b, respectively). Analysis included 38 tumours 
with BRCA1 mutations and 34 tumours with BRCA2 mutations out of 
316 high-grade serous ovarian cancers that underwent whole-exome 
sequencing. PFS curves for PTIP-low and PTIP-high expressing tumours 
were generated by the Kaplan-Meier method. All reported P values are 
two-sided. c, Western blot analysis for CHD4 and PTIP levels in PEO1 
cells infected with shNSC and shCHD4. Tubulin was used as loading 
control. d, Cell cycle profiles in PEO1 cells infected with shNSC and 
shCHD4 as measured by the incorporation of EdU (y axis) versus DAPI 
(x axis). e, Immunostaining for MRE11 and PCNA in PEOI cells infected 


with shNSC and shCHD4 upon treatment with 4 mM HU. Bottom, 
quantification for MRE11 recruitment upon HU treatment. At least 

100 cells were analysed per condition; experiments were repeated three 
times. f, Western blot analysis for CHD4 and MRE11 levels in PEO] cells 
infected with shNSC and shCHD4. Actin was used as loading control. 

g, Percentage of EdU-positive cells was analysed 20h after Brca2~/~ 

B cells were treated with mirin alone, mirin+PARPi or mirin+ cisplatin 
(NS, not significant). EdU was pulsed for 20 min before FACS analysis. 
Experiments were repeated three times. h, Genomic instability measured 
in metaphase spreads from B cells derived from Brca2~/~ mice pretreated 
with 251M mirin for 2 h followed by overnight treatment with 1 1M PARPi 
or 0.5 1M cisplatin (NS, not significant, *P < 0.05, **P < 0.001, unpaired 
t-test). Fifty metaphases were analysed per condition. Experiments 

were repeated three times. i, Quantification of RAD51 foci formation in 
WT, Breal~’-, Parpl ~~ and Brcal ~~Parp! ~~ B cells treated with 10 

Gy irradiation and harvested 4h after treatment. At least 100 cells were 
analysed per condition; experiments were repeated three times. 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | Brca2-deficient tumours acquire PARPi 
resistance without restoration of RAD51 foci formation. a, Schematic 
depicting the conditional BRCA2 allele of the KB2P (K14CRE;Brea2l,p53!"") ) 
spontaneous tumour model. b, Outline of the PARPi intervention 

study. A spontaneous BRCA2-/p53-deficient tumour was generated and 
re-transplated into syngeneic WT mice. When the tumours reached 

200 mm, they were treated either with vehicle or PARPi AZD2461. 

c, PARPi response curve of the KB2P tumour (relative tumour volume 
(rtv) versus days). The treatment for 28 consecutive days was started when 
the tumour reached 200 mm? (rtv = 100%). In response to the treatment, 
the tumour shrank but eventually grew back. When it reached 100% 
relative tumour volume, the treatment was repeated (as indicated with red 
arrows) for another 28 days. This regime was continued until the tumour 
became resistant to PARPi (black arrow). d, The stability of acquired 
resistance of the KB2P tumour was confirmed by re-transplanting 
matched naive and resistant tumours and treating animals either with 


vehicle or AZD2461 (only one 28-day cycle). Kaplan-Meier survival 
curve indicates that resistant tumours did not respond to the AZD2461 
treatment, while naive tumours exhibited high sensitivity, indicative of a 
stable genetic mechanism of resistance. e, f, Irradiation-induced RAD51 
foci were detected by immunofluorescence in the KB2P donor: RAD51 
foci formation was undetectable in naive and resistant tumours, suggesting 
that HR restoration is not an underlying mechanism of PARPi resistance. 
Spontaneous tumours from K14 Cre; pot mice treated with irradiation 
were used as positive control for RAD51 foci. Unirradiated K14 Cre; pst 
cells were used as a negative control. g, Replication fork progression rates 
measured by tract lengths in micrometres of CldU (red) and IdU (green) 
in PARPi-naive or PARPi-resistant tumours. Numbers in red indicate the 
mean + s.d. for each sample. One hundred and twenty-five replication 
forks were analysed for each condition. h, Western blot analysis for PTIP 
and MRE11 levels in PARPi-naive or PARPi-resistant tumours. Tubulin 
was used as loading control. 
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(identified by PCNA foci (green)). DNA was stained with DAPI (blue). 
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Relativistic reverberation in the accretion flow of a 


tidal disruption event 


Erin Kara), Jon M. Miller*, Chris Reynolds)? & Lixin Dai? 


Our current understanding of the curved space-time around 
supermassive black holes is based on actively accreting black holes, 
which make up only ten per cent or less of the overall population. 
X-ray observations of that small fraction reveal strong gravitational 
redshifts that indicate that many of these black holes are rapidly 
rotating’; however, selection biases suggest that these results are not 
necessarily reflective of the majority of black holes in the Universe’. 
Tidal disruption events, where a star orbiting an otherwise dormant 
black hole gets tidally shredded and accreted onto the black hole’, 
can provide a short, unbiased glimpse at the space-time around 
the other ninety per cent of black holes. Observations of tidal 
disruptions have hitherto revealed the formation of an accretion 
disk and the onset of an accretion-powered jet*®, but have failed 
to reveal emission from the inner accretion flow, which enables the 
measurement of black hole spin. Here we report observations of 
reverberation?” arising from gravitationally redshifted iron Ka 
photons reflected off the inner accretion flow in the tidal disruption 
event Swift J1644+57. From the reverberation timescale, we 
estimate the mass of the black hole to be a few million solar masses, 
suggesting an accretion rate of 100 times the Eddington limit or 
more!3, The detection of reverberation from the relativistic depths 
of this rare super-Eddington event demonstrates that the X-rays 
do not arise from the relativistically moving regions of a jet, as 
previously thought>"4. 

Swift J1644+-57 was detected by the Swift Burst Alert Telescope on 
28 March 2011 as it reached an isotropic X-ray luminosity greater than 
10*8 ergs~!. Follow-up radio, optical and infrared observations located 
the position of the source at the centre ofa compact, non-interacting, 
star-forming galaxy (redshift z=0.3534). The spatial coincidence with 
a normal galaxy and the decay in luminosity over time t as Lx t°/ 
led to the conclusion that this transient was a tidal disruption event 
(TDE)**!°, The peak luminosity exceeds the Eddington limit by four 
orders of magnitude (for a 10°-solar-mass black hole), which suggests 
that the X-ray emission is highly anisotropic, possibly originating from 
a relativistic jet pointed in our line of sight*. These energetics argu- 
ments suggesting a relativistic jet were bolstered by the detection of 
a strong radio afterglow at the nucleus of this previously inactive gal- 
axy”. Although the broadband observations all suggest that this TDE 
resulted in super-Eddington accretion and the launching of jets, the 
powering of the jets (whether magnetically driven or radiation-pres- 
sure-driven) and the role of geometric beaming remains unknown!*'*, 

Follow-up X-ray observations of Swift J1644+-57 were taken soon 
after the initial discovery, and found the source to be highly variable, 
further suggesting accretion onto a compact object. A 200s quasi- 
periodic signal was reported in the 2-10 keV light curve of the bright- 
est two observations at >99.99% and >99.73% confidence’”. Although 
the exact nature of the quasiperiodic oscillation is unknown, it does 
confirm the formation of an accretion disk just 10 days after the initial 
TDE. It was suggested” that this quasiperiodicity could originate from 


orbits at the inner edge of the accretion disk, resonances in the relativ- 
istic jet or variability in a turbulent accretion flow. Here, we reanalyse 
the early XMM-Newton and Suzaku observations to look at properties 
of the variability at all other timescales (Extended Data Table 1). We 
use a new technique, X-ray reverberation mapping, which combines 
spectral and timing techniques to understand the physical nature of 
the rapid X-ray variability. 

We proceed by extracting XMM-Newton light curves in narrow 
energy bands from 0.3 keV to 10 keV (observed frame) and then use 
Fourier techniques to extract the relative time delays between these 
bands as a function of the temporal frequency (see Methods). We find 
a coherent soft excess lag in the frequency range 0.0002-0.001 Hz 
(Extended Data Fig. 1), and by averaging the interband time delays over 
these frequencies produce the lag—energy spectrum shown in Fig. 1. 
We find that the 5.5-8 keV rest-frame emission is delayed by ~100s 
relative to the 4-5 keV and 8-13 keV emission (the same time delay 
can be seen directly between the rest-frame 5.5-8 keV and 8-13 keV 
light curves in Extended Data Fig. 2). The iron K lag is found with 
>99.9% confidence (see Methods for details). We also find the same 
iron K lag in the Suzaku observation taken 10 days earlier at >99.8% 
confidence (Extended Data Figs 3 and 4). The asymmetry of the line is 
confirmed at >98% confidence (Extended Data Fig. 5, Extended Data 
Table 2). The iron K lag is characteristic of emission that reverberates 
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Figure 1 | Lag-energy spectrum of Swift J1644+57. The lag-energy 
spectrum with 1c error bars in the frequency range (2-10) x 10-*Hz. 
The zero point lag has been shifted so that zero lag corresponds to the 
emission that varies first and larger lags correspond to the correlated 
emission that varies later. This lag—energy spectrum shows that the 
emission from ~4-5 keV and 8-13 keV (primary continuum-dominated 
bands) vary first, and the iron line from ~7-8 keV responds ~100s later, 
consistent with short-timescale reverberation off an ionized accretion 
disk. The asymmetric profile of the iron K lag shows that the emission is 
gravitationally redshifted and therefore much of the emission originates 
close to the central massive black hole. 
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Figure 2 | The energy spectrum of Swift J1644+57. The time-integrated 


energy spectrum in normalized counts units of the observation shown in 

Fig. 1. The top panel shows the data in black and the best-fit power-law 
model for 2-10 keV (observer’s frame) in red. Errors are at the 1a level. The 
fit includes intrinsic absorption of ~10” cm ~’, similar to previous analyses’’. 
The bottom panel shows the ratio of the data to that of the absorbed 
power-law model. The green line indicates a ratio of one, corresponding to 
agreement between the model and the data. The residuals show a clear excess 
at 8 keV, the same energy as the peak of the lag—energy spectrum in Fig. 1. 


off the inner accretion flow and is re-emitted as a relativistically broad- 
ened iron Ka line. Such short-timescale lags between the direct emis- 
sion and the reprocessed emission indicate that the emitting region is 
very small, within tens of gravitational radii of the central black hole. 
The asymmetric profile of the iron K lag indicates that the iron Ka 
emission line is gravitationally redshifted from within 10 gravitational 
radii. Relativistic reverberation lags associated with the broad iron Ka 
line are commonly found in rapidly varying local Seyfert galaxies'*-”° 
(see Extended Data Fig. 6 for examples). 

Time lag analysis is a powerful technique because it disentangles the 
variable and correlated emissions from other emissions in the time- 
integrated energy spectrum. We examined the time-integrated energy 
spectrum in detail to search for further evidence of the iron K emission 
line. The resulting spectrum is shown in Fig. 2 compared with the best- 
fit absorbed power law for 2-10 keV. There is an excess in the energy 
spectrum at rest-frame 8 keV, the same energy as the peak in the lag- 
energy spectrum. To estimate the strength of the spectral feature, we fit 
it with a simple Gaussian function and find the equivalent width (that 
is, the area of the Gaussian above the continuum) to be 60 + 10 eV. If 
this line corresponds to highly ionized iron Ka emission at 6.97 keV, 
then the emission is blueshifted by a line-of-sight velocity of ~0.15c. 

With our new timing results, we estimate the mass of the central 
black hole by comparing the lag amplitude and lag frequency of Swift 
J1644+57 to Seyfert galaxies with known reverberating broad iron 
lines. The time lag of the broad iron line is determined by the light 
travel time between the continuum-emitting region and the X-ray 
photosphere. The temporal frequency at which the iron K reverbera- 
tion lag is found is determined by how fast the continuum is varying. 
Both the light travel time and the variability timeframe are expected 
to scale with the mass of the central black hole. These two correlations 
have been seen in Seyfert galaxies for which X-ray reverberation is 
detected’®”". In Fig. 3, we plot (as grey hexagons) the known corre- 
lations of the rest-frame temporal frequency of the iron K lag versus 
the black hole mass (Fig. 3a) and the rest-frame amplitude of the iron 
K lag versus black hole mass (Fig. 3b) for several Seyfert galaxies. The 
red solid lines in both panels show the best-fit line to these correlations 
using an orthogonal-distance regression fitting procedure to account 
for the error in both the x and y variables. For clarity, we do not plot 
the error bars on the mass in the figure, but they were included in the 
model fitting (see Extended Data Table 3 for the names and references 
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Figure 3 | Comparison of lags in Swift J1644+57 with other Seyfert 
galaxies. a, The rest-frame frequency range of the iron K lag versus black 
hole mass (in units of solar masses, Mo) for several Seyfert galaxies 

(grey hexagons; error bars indicate the frequency range). The solid red 
line indicates the best-fit linear model (in log-log space) with 1o errors 
(red dashed lines). The blue square and error bars indicate the rest-frame 
frequency range of the iron K lag in Swift J1644+-57, and the blue shaded 
region shows the lo range in mass suggested by this correlation. b, As 

in a except the correlation is with the amplitude of the lag (with 1¢ error 
bars) versus black hole mass. The amplitude of the lag in Swift J1644+57 
is calculated between 4—-5.5 keV and the peak of the line at 6.5-8 keV. Both 
correlations suggest a black hole mass of a few million solar masses. 


of individual sources). The blue squares indicate the redshift-corrected 
temporal frequency (Fig. 3a) and lag amplitude (Fig. 3a) for our TDE 
source placed at the black hole mass suggested by the best-fit line. 
Both the lag amplitude and frequency correlations suggest a mass of 
a few million solar masses (as shown by the blue shaded region), con- 
firming that this source achieves isotropic luminosities exceeding the 
Eddington limit by a factor of >10. The TDE fall-back rate for such a 
black hole mass is sufficient to produce a luminosity of ~10* ergs"! 
assuming a radiative efficiency of ~10%. See Methods for a discussion 
on the spread in this correlation, and the possible biases in mass that 
such a spread may imply. 

The detection of iron K reverberation in this super-Eddington 
accretion event puts strong constraints on the geometry of the flow 
and the launching mechanism of the jet. It definitively shows that the 
optically thick accretion flow is being irradiated by a variable X-ray 
source. Although blazars sometimes show soft lags”, there is no 
evidence of iron K reflection in either their lags or their time- 
integrated energy spectra. Therefore, the detection of iron K reverbera- 
tion in Swift J1644-++57 strongly disfavours a model in which the X-ray 
emission is produced by a highly relativistic (Lorentz factor >10), mag- 
netically powered, blazar-like jet, as previously discussed’. Following 
recent theoretical developments in our understanding of super- 
Eddington accretion flows”*-*°, we believe that we are looking down 
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Figure 4 | Schematic of reverberation off a super-Eddington accretion 
flow. Swift J1644+-57 is thought to be a super-Eddington accreting source, 
with a very thick, face-on accretion disk (blue) and a relativistic radio jet 
(orange). We suggest that the iron K lag originates from the light travel time 
between a flaring X-ray source and an irradiated outflowing funnel wall. 

In this schematic of the cross-section of the thick disk, the observer is 
looking down the funnel into the relativistic depths close to the central black 
hole. Variable X-ray continuum emission (lightning bolts) follows several 
light paths (two of which are shown as red arrows). The continuum emission 
reflects off the walls of the funnel and into the observer's line of sight. The 
strong blueshift of the iron lag is due to special relativistic Doppler shifts, 
and the asymmetric red wing is due to gravitational redshift from the strong 
gravity of the black hole. The blue arrows represent the dynamics in the disk: 
the accretion flow rotates around the central black hole and the walls of the 
funnel are outflowing at ~0.1c-0.5c. 


the polar funnel of a geometrically thick, radiation-supported flow 
(see Fig. 4 for a schematic). Therefore, the blueshift of the peak in the 
iron line is probably not due to disk rotation (requiring an edge-on 
disk), but rather due to the Doppler shift from outflowing material on 
the funnel wall that has been radiatively accelerated to 0.1c-0.5c. In 
such a flow, a substantial part of the beaming is geometric (radiation 
preferentially escapes down the funnel) and the slow jet is probably 
launched by radiation pressure (see Methods and Extended Data Fig. 7 
for an application of this simple model). 

Relativistic reverberation off an optically thick super-Eddington 
accretion flow tells us that much of the X-ray emission originates close 
to the event horizon. In principle, therefore, the X-ray emission can 
be used to measure the spin of the central black hole. However, our 
current models for measuring black hole spin from the broadening of 
the iron K emission assume a geometrically thin disk extending to the 
innermost stable circular orbit (ISCO), an assumption that is not likely 
to be relevant for this super-Eddington source, which has a very thick 
accretion disk that possibly extends beyond the ISCO to the marginally 
bound orbit’””*. Therefore, at this time we cannot claim an estimate 
of the black hole spin; but, with future developments in modelling 
such flows, relativistic reverberation from TDEs offers a potentially 
powerful technique for measuring spin—not only in the 10% of black 
holes that are persistently accreting, but also in the 90% of dormant 
black holes in the Universe. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 16 December 2015; accepted 4 April 2016. 
Published online 22 June 2016. 


1. Reynolds, C. S. The spin of supermassive black holes. Class. Quant. Grav. 30, 
244004 (2013). 

2. Vasudevan, R. V., Fabian, A. C., Reynolds, C. S., Dauser, T. & Gallo, L. C. A selection 
effect boosting the contribution from rapidly spinning black holes to the cosmic 
X-ray background. Mon. Not. R. Astron. Soc. 458, 2012-2023 (2016). 

3. Rees, M. J. Tidal disruption of stars by black holes of 10°-108 solar masses in 
nearby galaxies. Nature 333, 523-528 (1988). 

4. Burrows, D. N. et al. Relativistic jet activity from the tidal disruption of a star by 
a massive black hole. Nature 476, 421-424 (2011). 

5. Zauderer, B. A. et al. Birth of a relativistic outflow in the unusual gamma-ray 
transient Swift J164449.34+573451. Nature 476, 425-428 (2011). 


390 | NATURE | VOL 535 | 21 JULY 2016 


6. Bloom, J. S. et al. A possible relativistic jetted outburst from a massive 
black hole fed by a tidally disrupted star. Science 333, 203-206 (2011). 

7. Miller, J. M. et al. Flows of X-ray gas reveal the disruption of a star by a massive 
black hole. Nature 526, 542-545 (2015). 

8. Komossa, S. Tidal disruption of stars by supermassive black holes: status of 
observations. J. High Energy Astrophys. 7, 148-157 (2015). 

9. Fabian, A. C. et al. Broad line emission from iron K- and L-shell transitions in 
the active galaxy 1H 0707-495. Nature 459, 540-542 (2009). 

10. Zoghbi, A., Fabian, A. C., Reynolds, C. S. & Cackett, E. M. Relativistic iron K 
X-ray reverberation in NGC 4151. Mon. Not. R. Astron. Soc. 422, 129-134 
(2012). 

11. Cackett, E. M. et a/. Modelling the broad Fe Ka reverberation in the AGN 
NGC 4151. Mon. Not. R. Astron. Soc, 438, 2980-2994 (2014). 

12. Uttley, P, Cackett, E. M., Fabian, A. C., Kara, E. & Wilkins, D. R. X-ray reverberation 
around accreting black holes. Astron. Astrophys. Rev. 22, 72 (2014). 

13. Evans, C. R. & Kochanek, C. S. The tidal disruption of a star by a massive 
black hole. Astrophys. J. 346, L13-L16 (1989). 

14. Tchekhovskoy, A. Metzger, B. D., Giannios, D. & Kelley, L. Z. Swift J1644+57 
gone MAD: the case for dynamically important magnetic flux threading the 
black hole in a jetted tidal disruption event. Mon. Not. R. Astron. Soc. 437, 
2744-2760 (2014). 

15. Levan, A. J. et al. An extremely luminous panchromatic outburst from the 
nucleus of a distant galaxy. Science 333, 199-202 (2011). 

16. Coughlin, E. R. & Begelman, M. C. Hyperaccretion during tidal disruption 
events: weakly bound debris envelopes and jets. Astrophys. J. 781, 82 (2014). 

17. Reis, R. C. et al. A 200-second quasi-periodicity after the tidal disruption of a 
star by a dormant black hole. Science 337, 949-951 (2012). 

18. Zoghbi, A. et al. Discovery of Fe Ka X-ray reverberation around the black holes 
in MCG-5-23-16 and NGC 7314. Astrophys. J. 767, 121 (2013). 

19. Kara, E. et al. Discovery of high-frequency iron K lags in Ark 564 and Mrk 335. 
Mon. Not. R. Astron. Soc. 434, 1129-1137 (2013). 

20. Alston, W. N. et a/. Discovery of a ~2-h high-frequency X-ray QPO and iron Ka 
reverberation in the active galaxy MS 2254.9-3712. Mon. Not. R. Astron. Soc. 
449, 467-476 (2015). 

21. De Marco, B. et al. Discovery of a relation between black hole mass and 
soft X-ray time lags in active galactic nuclei. Mon. Not. R. Astron. Soc. 431, 
2441-2452 (2013). 

22. Ravasio, M. et al. Observing Mkn 421 with XMM-Newton: the EPIC-PN 
point of view. Astron. Astrophys. 424, 841-855 (2004). 

23. Sadowski, A. & Narayan, R. Powerful radiative jets in supercritical accretion 
discs around non-spinning black holes. Mon. Not. R. Astron. Soc. 453, 
3213-3221 (2015). 

24. Jiang, Y.-F. Stone, J. M. & Davis, S. W. A global three-dimensional radiation 
magneto-hydrodynamic simulation of super-Eddington accretion disks. 
Astrophys. J. 796, 106 (2014). 

25. McKinney, J. C., Dai, L. & Avara, M. J. Efficiency of super-Eddington 
magnetically-arrested accretion. Mon. Not. R. Astron. Soc. 454, L6-L10 
(2015). 

26. Abramowicz, M. A., Czerny, B., Lasota, J. P. & Szuszkiewicz, E. Slim accretion 
disks. Astrophys. J. 332, 646-658 (1988). 

27. Abramowicz, M. A. et al. Leaving the innermost stable circular orbit: the inner 
edge of a black-hole accretion disk at various luminosities. Astron. Astrophys. 
521, A15 (2010). 

28. Krolik, J. H. & Hawley, J. F. Where is the inner edge of an accretion disk around 
a black hole? Astrophys. J. 573, 754-763 (2002). 


Acknowledgements E.K. thanks A. Zoghbi, M. C. Miller, F. Tombesi, E. Miller 
and L. Denby for discussions. E.K. also thanks the Hubble Fellowship 
Program for support under grant number HST-HF2-51360.001-A from the 
Space Telescope Science Institute, which is operated by the Association 

of Universities for Research in Astronomy, Incorporated, under NASA 
contract NAS5-26555. J.M.M. acknowledges N. Schartel and XMM-Newton 
for executing target-of-opportunity observations of Swift J1644+57. 

C.R. acknowledges support from NASA under grant number NNX14AF86G. 
L.D. thanks J. McKinney for discussions. L.D. acknowledges support from 
NASA/NSF/TCAN (NNX14AB46G), NSF/XSEDE/TACC (TG- PHY 120005) and 
NASA/Pleiades (SMD-14-5451). This work is based on observations made 
with XMM-Newton, a European Space Agency (ESA) science mission with 
instruments and contributions directly funded by ESA member states and the 
US (NASA) and the Suzaku satellite, a collaborative mission between the 
space agencies of Japan (JAXA) and the US (NASA). 


Author Contributions E.K. led the XMM-Newton and Suzaku time lag analysis, 
simulations, interpretation of the results and manuscript preparation. 

J.M.M. performed the XMM-Newton spectral analysis and contributed to 

the interpretation of the results. C.R. developed the analytical toy model for 
reverberation in a super-Eddington flow and contributed to the interpretation 
of the results. L.D. had the idea of examining X-ray time lags in a TDE and 
contributed to the interpretation of the results. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to 

E.K. (ekara@astro.umd.edu). 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


METHODS 

XMM-Newton data reduction. XMM-Newton”? observed Swift J1644-+57 with 
the EPIC-pn camera” for 25 ks in large-window imaging mode with a medium 
optical block filer. We reduced the data using the XMM-Newton Science Analysis 
System (SAS version 14.0.0) and the newest calibration files. The events were 
filtered with conditions PATTERN < 4 and FLAG=0. See Extended Data Table 1 
for a summary of the observations used in this analysis. 

For the spectral analysis we used an annulus source region, excising the cen- 
tral pixels as the spectrum is clearly affected by pile-up. We excised the central 
12 arcsec of the circular source region of 60 arcsec. We could not use the same 
source region for the detailed timing analysis because this source region spanned 
two chips that have slightly different readout times that affect the timing analysis. 
We therefore used a smaller circular source region of 35 arcsec contained on a 
single chip. For the timing analysis we did not need to excise the central pixels 
because pile-up does not affect the measurement of the lag. Pile-up is a random 
process and therefore does not contribute to the correlated variability between 
energy bands from which the lags are determined*’. 

For both the spectral and timing analyses we used a circular background region 
with a radius of 70 arcsec. There were no strong background flares during the 
observation and the 0.3-10 keV background count rate (corrected for region size) 
never exceeded 0.4 counts per second, ~4% of the source count rate. The response 
matrices were produced using the rmfgen and arfgen tools in SAS. 

We used the epiclccorr tool to produced background subtracted light curves 
with 10 s bins in several small energy bands in the range 0.3-10 keV. The 
0.3-10keV background-subtracted light curve is shown in Extended Data Fig. 2. 
The inset of Extended Data Fig. 2 shows a zoom-in of the light curves during 
the brightest, most variable part of the observation. We specifically examine 
the 4-6 keV light curve (red) and the 6-10 keV light curve (blue). By examining 
closely the light curves, one can see that on average the 4-6 keV band lags with 
respect to the 6-10 keV band (as shown in the lag-energy spectrum of Fig. 1). 
Although a lag is evident even in the time domain, the Fourier domain offers a 
clearer look at the lags on particular timescales. 

We attempted to extract lags and spectra from an XMM-Newton observation 

taken shortly after the original one. Unfortunately, this observation was heavily 
affected by background flares, as noted previously’”. Only 4 ks of the entire 21 ks 
exposure had a background level lower than the suggested maximum rate of 
0.4 counts per second in the 10-12 keV background light curve. No iron K line 
or lag was detected, but given the poor quality of the data we cannot say whether 
this non-detection is intrinsic to the source. 
Fourier time lag analysis of the XMM-Newton data. The Fourier time lag 
analysis follows the standard procedure’. In short, for every energy bin shown 
in the lag-energy spectrum (Fig. 1), the time lag is computed between the light 
curve in that particular bin of interest and the large reference-band light curve 
from 0.3-10 keV. We remove the bin of interest from the reference band so we do 
not have correlated errors. As the lag is measured in each bin of interest relative to 
essentially the same reference band, the lag-energy spectrum displays the relative 
lag between different energy bins. 

To compute the lag we take the Fourier transform of the light curve and the 
reference-band light curve and compute the cross spectrum by multiplying one 
band and the complex conjugate of the other band. The argument of the cross 
spectrum is the phase difference between the two bands. For the lag-energy 
spectrum, we focus on the average phase lag over a particular frequency range. 
We convert the phase lag into a time lag by dividing by the midpoint frequency. 
In practice, the lag—energy spectrum is read from bottom to top—the bins with 
the smallest lag respond first, whereas the bins with larger lags are delayed with 
respect to the other bins. 

We choose the frequency range in the usual way, on the basis of the lag- 
frequency spectrum between rest-frame 0.3-1 keV and 1-4keV (Extended Data 
Fig. 1). These bands are chosen because the 0.3-1 keV band is typically the band 
dominated by the soft excess and the 1-4 keV band is typically the band most 
dominated by the continuum”!”. We find a negative lag (that is, the soft band 
lagging the hard band) from (2-10) x 10-“Hz and therefore our lag—energy 
analysis is focused on this particular frequency range. 

The power of the cross-spectral Fourier technique is that we pick out variability 
features that are correlated between energy bands. This, in principle, allows us to 
pick out components that may be lost in the time-averaged spectrum. 

Suzaku data reduction. We process the unfiltered event files for each of the XIS 
CCDs. The source and background regions were ~3.5 arcmin in radius. XSELECT 
was used to extract the spectral products, and medium-resolution responses were 
generated using XISRESP. The front- and back-illuminated light curves were com- 
bined for the time lag analysis. We extracted light curves between 1-8keV in 1 keV 
energy bins and 128s time bins. We include the 1.8-2.2 keV band where there are 
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known calibration uncertainties because they do not affect the time lag analysis. 
The signal was too low in the 8-10 keV bins, and the maximum-likelihood analy- 
sis could not converge. We therefore exclude these highest-energy bins from the 
analysis. The total Suzaku exposure is 52 ks. 

The Suzaku spectrum does not show a clear iron K emission line. We find an 

upper limit of 7 eV on an iron line of equivalent width at the same energy as the 
XMM-Newton spectrum. We closely examined the spectrum above 3 keV, ignor- 
ing the Au edge at ~2.3 keV, which resulted in additional residuals to the fit. We 
find that the 3-10 keV spectrum shows slight hardening above ~6 keV. This hard- 
ening is not likely to be due to pile-up because the observation was taken in the 
one-quarter window mode and the observed count rate is well below the nominal 
pile-up limit of 100 incident counts per second for the one-quarter window mode. 
Furthermore, excising the central 20 arcsec of the source region does not remove 
the hardening. We checked the Suzaku Hard X-ray Detector (HXD) spectrum 
and found that the excess hardening was also present above 20 keV. Extended 
Data Fig. 3 shows the ratio of the Suzaku X-ray Imaging Spectrometer (XIS) and 
HXD data to an absorbed power-law fit to the 3-6 keV range (this includes a 
normalization cross-calibration constant of 1.16 between front-illuminated XIS 
and HXD, as suggested in the Suzaku ABC guide). This ratio plot displays the 
hard tail above ~6 keV and the excess above 20 keV. The excess hardening and 
curvature above 20 keV are reminiscent of the Compton hump. The presence of a 
Compton hump without the accompanying detection of an iron line is puzzling, 
but could suggest the presence of an iron line that is so severely broadened that it 
is rendered undetectable. This in turn would suggest that the reflection features 
were produced much closer to the black hole (and that our line of sight extended 
down to this region) during the Suzaku observation compared with the later 
XMM-Newton observation. 
Maximum likelihood analysis of Suzaku data. As Suzaku is in a low-Earth orbit 
there are gaps in the observations approximately every 3,000s. The timescales 
relevant to this system are longer than 3,000 and therefore we cannot rely on 
the Fourier method. Instead we chose the maximum-likelihood method*?, which 
is used to analyse the lags in NuSTAR observations***»; for further details of the 
method and Monte Carlo simulations that compare it with the Fourier method 
see the indicated references. 

The Suzaku lag-energy spectrum is shown in Extended Data Fig. 4. The lag 
increases up to the same at the same energy as in the XMM-Newton observation 
(8keV rest frame) and then drops again at higher energies. We note that the iron K 
lag amplitude that we measure is roughly the same size as the time bin of the light 
curve. The lag amplitude that we measure is an average time lag, and therefore the 
observed lag amplitude is not directly set by the bin size. The only clear difference 
between the Suzaku lags and those found in XMM-Newton are from 1-2 keV. The 
1-2keV bin in the Suzaku data are 110+ 50s less than the XMM-Newton lags. 
We hesitate to over-interpret one bin, but note that differences in the soft excess 
are also seen between Seyfert galaxies that show similar iron K lag features’®. 

This analysis shows that we detect iron K reverberation in a TDE in two obser- 

vations taken at different times, made with different observatories and analysed 
using different techniques. 
Significance of the lag. In this section we present several statistical tests to 
confirm the detection. We start by testing whether the observed lags are due 
to random fluctuations in the data. Next we test whether the observed lags are 
consistent with the behaviour of the iron K lags seen in other Seyfert galaxies. 
Once we determine the significance of the detection of the iron K lag, we test 
whether this line profile is broad and asymmetric, as these features have important 
implications for the interpretation of the iron K emission originating from close 
to the central black hole. 

We start by testing the detection of the lags using Monte Carlo simulations. We 
simulate light curves using the method described in ref. 36. We obtain our under- 
lying power spectral density model (PSD) by fitting the 0.3-10keV observed PSD 
with a broken power law (plus a constant for the white noise). With this relatively 
short observation, we do not observe a break in the PSD and so we assume a break 
frequency of 10-*Hz to a low-frequency power law with index 1 (consistent with 
previous results*” on the low-frequency variability of Swift J1644+57). Above 
10° *Hz we find a high-frequency power-law index of 2.1. 

We simulate 10,000 light curves in each energy bin, scaled to the observed 
variance, count rate and exposure for that bin. We compute the time lag between 
each bin and the large reference band and assume zero intrinsic lag. Extended 
Data Fig. 5 shows the 1a distribution of the lags in each energy bin overlaid with 
the observed XMM-Newton lag-energy spectrum to check the probability that the 
observed 0.3-10 keV lag—energy spectrum could be due to random fluctuations. 
Summing x? = (x — u)2/o? for each independent energy bin (where x is the 
observed lag, ju is the mean of our zero-lag null hypothesis and a is the standard 
deviation based on the Monte Carlo simulations), we obtain \/d.o.f. = 33/17. 
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Our simulations therefore rule out a zero-lag null hypothesis with >99% 
confidence. This statistical test does not, however, account for the fact that the 
lags are coherently distributed above and below the zero-lag null hypothesis 
and not simply randomly distributed. The distribution of lags is Gaussian for all 
energy bins. 

Next we test whether the 3-10 keV lags that we observe are associated with iron 
K emission lagging behind the continuum. Most sources with iron K lags show 
the smallest lag at the red wing of the iron line, a gradual increase in the lags up 
to the line centroid and a sharp drop above the line centroid back to small lags!”. 
Phenomenologically, this behaviour is well described by a power-law continuum 
plus an asymmetric and broad emission line peaking at the line centroid. We use 
the diskline model as a phenomenological model to test for the presence of an 
iron K lag similar to those seen in Seyfert galaxies, however, we emphasize that 
the diskline (which describes the line broadening from a rotating, inclined, thin 
accretion disk) is formally not an appropriate model to describe emission from 
a face-on thick accretion flow. For the purposes of this simple statistical test for 
the presence of an iron K line in the lag—energy spectrum, the diskline model 
is simply employed in its role as a ‘generic’ skewed line profile and is sufficient. 

To test for the presence of the iron K lag in Swift J1644+-57, we compare two 
nested models in the 3-10 keV band: one of only a constant fitted to the 3-10 keV 
lag-energy spectrum and other containing an additional diskline model. For the 
diskline model we leave the line energy and inclination free (to allow some freedom 
in the width of the line profile) and freeze the inner and outer radii to 6rg and 
200r,, respectively (where rg = GM/c’ is the gravitational radius), with a power-law 
disk emissivity. We use an F-test to compare two nested models. Because the nor- 
malization of the lag can be negative, we avoid the problem of fitting too close to 
the parameter space boundary”. The resulting F-statistic value is Ax?/y? = 23.8 
and therefore the constant plus diskline model is preferred with >99.9% confi- 
dence. If we assume that the iron K lag found in Suzaku data has the same diskline 
energy and inclination as the XMM-Newton lag, then the F test indicates that 
the constant plus diskline model is preferred to >99.8% confidence. The same 
test with a Gaussian rather than a diskline model results in the Gaussian being 
preferred at >99.2% confidence for the XMM-Newton data and 98.8% confidence 
for the Suzaku data. This suggests that an asymmetric diskline model is preferred 
over the Gaussian, though this needs to be tested rigorously (see below). 

Now that we have tested for the presence of an iron K lag, similar to those seen 

in Seyfert galaxies, we test whether the iron line is broad and whether the iron 
line is asymmetric. For the first test, we simply use the function steppar in XSPEC, 
which steps through the parameter space to calculate Ay? at each step. We find 
that the Gaussian is inconsistent with a narrow line at >99% confidence. Next we 
compare the best-fit broad Gaussian model with the asymmetrically broadened 
diskline model. Although the diskline model provides a lower x? value for the 
same number of degrees of freedom, both models result in formally acceptable fits 
(that is, result in a reduced \” less than 1). See a comparison of the Gaussian and 
diskline models in the right panels in Extended Data Fig. 5. We follow a Bayesian 
formalism’? to quantify the improvement of the asymmetric diskline. To compare 
the broad Gaussian fit with the asymmetric diskline model, we compute the ratio 
of the evidences—the so-called Bayes’ factor**. The Bayes’ factor is the ratio of 
the posterior probabilities of our two models given the observed data, which we 
determine by stepping through the free parameters of each model and computing 
? at each point. We convert y’ into a probability for each step and sum the prob- 
abilities over the domain (in effect, integrating the probability over our defined 
parameter space). The free parameters and their domains for each model are 
shown in Extended Data Table 2. For simplicity, we freeze the normalization of 
the power law for both models to be the average lag from 3-4 keV and 6-10 keV. 
We find the Bayes’ factor (the ratio of the diskline-integrated probability to the 
Gaussian probability) to be 55. This corresponds to the asymmetric diskline being 
favoured over the Gaussian at >98% confidence. 
Comments on the reverberation lag-mass relation. In Fig. 3, we compared 
the frequency and amplitude of the iron K lag with the black hole mass for sev- 
eral Seyfert galaxies with known reverberation lags. Those sources are shown in 
Extended Data Table 3. We use all published iron K lag results, excluding the iron 
K lag in MS 22549—3712, which is associated with a quasiperiodic oscillation and 
may not have the same variability mechanism as the other sources”®. 

Where possible, we use optical reverberation mass estimates with scale factor 
(f) =4.3 (ref. 39). These were obtained from the AGN (active galactic nuclei) 
Black Hole Mass Database (http://astro.gsu.edu/AGNmass). Most masses were 
calculated using the relation between the broad line region (BLR) radius and the 
luminosity, Rpir« L" (refs 40-42), with a = 0.5. For sources that do not have an 
optical reverberation mass estimate, we assumed an error of 0.5 dex. 

Some of the spread in the correlation between the lag amplitude and black 
hole mass in Seyfert galaxies is probably due to differences in the accretion flow 


geometry and orientation. To this effect, if the geometry of the tidal disruption 
is very different to that of Seyfert galaxies (as we expect), then Swift J1644+-57 
may not lie on this correlation. Therefore, some caution should be exercised in 
using the lag amplitude as a definitive indicator of the mass. We do note however, 
that although the amplitude of the lag is dependent on the geometry of the flow 
(that is, dilution effects, different light travel times, the effect of inclination), the 
frequency at which the lag is found is less dependent on geometry and is therefore 
likely to be a more trustworthy indicator. 

In Extended Data Fig. 6 we compare the reverberation lags of three rapidly var- 

iable Seyfert galaxies and the TDE Swift J1644-++57. The sources are 1H 0707—405, 
IRAS 13224—3809 and Swift J2127.4+5654. To compare the shapes of the 
lag-energy spectra of different mass objects, we have scaled the lags so that they 
have the same iron K lag amplitude. There are clearly similarities in the overall 
spectra (for example, the lag of the soft excess, broad iron K line and Compton 
hump with respect to the continuum), although there are differences in the shapes 
of the line profiles. Swift J644+-57 has a slightly narrower iron K line profile than 
1H 0707—495 and IRAS 13224—3809. The energy of the line centroid in Swift 
J1644+-57 is also slightly higher than the line centroid in Swift J2127.4+5654. We 
discuss possible physical reasons for this in the next sections. 
Theoretical framework of super-Eddington accretion in TDEs. During a TDE 
the black hole goes from being dormant to radiating at or above the Eddington 
luminosity within around one month, and then decays in around a year to sub- 
Eddington levels!*. The dynamical mass fallback rate of the debris is super- 
Eddington when the black hole mass is less than 50 million solar masses!°. If the 
circularization of the debris is efficient enough**“®, then the accretion on the 
black hole is also super-Eddington. 

The small super-Eddington accretion disk in a TDE has not been well studied, 

but very recently numerical simulations have been employed to understand 
super-Eddington disks in persistent sources such as AGN or X-ray binaries***». 
Although the overall radiative efficiency and luminosity are still debated, in all 
simulations the disk structure is geometrically thick and optically thick, deviating 
from the standard thin disk picture*”. A strong optically thick outflow is also 
generated and radiation can only leak through a narrow funnel along the polar 
direction. Close to the black hole, simulations show that a jet can help to carve 
out the inner accretion flow, thus exposing the X-ray emitting region of the disk”. 
In the final section, we present a toy model of the X-ray emission based on the 
geometry suggested by these numerical simulations. 
Iron line reverberation from the outflowing eye-wall of a super-Eddington 
accretion flow. XMM-Newton shows that the iron line peaks at 7.5-8 keV in 
both the direct energy spectrum and in the lag—energy spectrum. Even in the 
case of hydrogen-like iron (with an iron-Ka line energy of 6.97 keV) the observed 
energies necessitate a significant blueshift. In the case of sub-Eddington AGN that 
possess geometrically thin accretion disks such blueshifts would suggest a high- 
inclination accretion disk. In the case of Swift J1644+57, however, multiple lines 
of evidence*? point to a low-inclination (pole-on) geometry. Given the robust 
theoretical expectation (discussed in the previous section) that the accretion disk 
of such a super-Eddington source should be very geometrically thick, we suggest 
that the blueshifts seen in the iron line correspond to (radiation-driven) outflows 
in the funnel, or ‘eye wall; of a geometrically thick accretion flow. In addition to 
the blue wing, the strong red wing of the iron line necessitates gravitational red- 
shift (neglecting the small effect of multiple scattering events that can Compton 
downscatter some iron K photons to lower energies). This therefore suggests that 
the base of the geometrically thick funnel reaches deep into the potential well of 
the black hole, within 10 gravitational radii. 

As proof of principle we construct a simple toy model of iron line reverbera- 
tion in a funnel geometry (see Fig. 4). The funnel is assumed to be conical with a 
half-opening angle of 6, and we consider X-ray line reverberation driven by some 
variable X-ray source on-axis in the funnel at distance Rx from the black hole. 
We suppose that line emission can be excited along the walls of the funnel from 
radii Rin (within which we assume the accretion flow to be completely optically 
thick) to Rmax (beyond which the disk starts to flatten off and cannot see the X-ray 
source). In this toy model, we neglect the rotation of the flow and simply prescribe 
a radial outflow velocity profile for the wall material: 


R— Ruin 
R+ Race 


v(R) = vo + (%— Vo) 


where v9 is an initial velocity, v; is a terminal velocity, and Racc is the characteristic 
radius over which the acceleration proceeds. We then calculate the line profile in 
both the flux—-energy and lag—energy spectra, assuming that we are viewing the 
funnel exactly pole-on. We include the effects of the relativistic Doppler shifts, 
relativistic beaming and gravitational redshift on both the excitation of the line 
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emission by the source and the observed energy/lags. We do not, however, account 
for general relativistic light bending. 

Example results for a range of terminal velocities are shown in Extended Data 
Fig. 7 for the following illustrative parameters: 0 = 11/4, Rmin = 61g, Rmax = 200rg, 
Rx =30rg, vo = 0. To calculate time-lags in physical units, we also need to assume 
a black hole mass: we take M=2 x 10°Mo. Strongly blueshifted line peaks can 
readily be obtained—with this parameter set, we find that a terminal velocity of 
vy, 0.5c is required to have the line peak at 7.8 keV. We also obtain lag-energy 
spectra comparable to those seen in the data, peaking at 7.8 keV with a tail to lower 
energies. We obtain qualitatively similar results if we have a narrower opening 
angle (that is, @= 1/6) and slower terminal velocity (v;~ 0.3c). Because of this 
degeneracy, we do not make a strong claim of the funnel opening angle. 

An important technical point to note is that the time-lags derived in a rever- 

beration analysis are smaller than the actual time-lags (that is, the light travel time 
of the extra path-length traversed by the reflected emission) due to dilution of the 
line-band by the coherent component of the direct continuum emission. In the toy 
model results shown here, we derive the observed (diluted) lags by multiplying 
the actual lags by the fraction that the line emission contributes at each energy, 
normalizing so that the peak of the line is 30% of its local continuum (motivated 
by the observed total energy spectrum, Fig. 2). It remains a distinct possibility that 
some fraction of the observed X-ray continuum is incoherent, meaning that we 
have overcorrected for dilution and hence underpredicted the dilution-corrected 
lags. A full treatment of dilution in any theoretical model requires a consideration 
of the full reflection spectrum and a Fourier-domain treatment that parallels that 
applied to the real data. 
Code availability. The model fitting of spectra and lag-energy spectra was com- 
pleted with XSPEC, which is available at the HEASARC website (http://heasarc. 
gsfc.nasa.gov). The code used for the time lag analysis and Monte Carlo simu- 
lations is not currently available. All figures were made in Veusz (http://home. 
gna.org/veusz). 
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Extended Data Figure 1 | XMM-Newton lag-frequency spectrum. 

The lag—-frequency spectrum between rest frame 0.3-1 keV and 1-4keV 
(the usual soft and hard bands chosen for reverberation studies””!). The 
lag—frequency spectrum shows a negative lag (that is, the soft band lagging 
behind the hard band) from (2-10) x 10-4 Hz (highlighted by the red 
hashed region). Because the lag-frequency spectrum shows a soft lag at 
these frequencies, we examined further the lag-energy spectrum at this 
particular frequency range (see Fig. 1). Error bars are at the Io level. 
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Extended Data Figure 2 | XMM-Newton light curve of Swift J1644+57. 
The observed 0.3-10 keV light curve in 200s bins showing the rapid 
variability in this 25 ks observation. Note that the y axis does not extend 
to zero count rate. Error bars are at the 1a level. Inset, the 4-6 keV 
(~5.5-8 keV rest frame) light curve (red) and the 6-10 keV (~8-13 keV 
rest frame) light curve (blue) zoomed in on the most variable part of the 
light curve from 5,000 s to 15,000 s (indicated by the red line 


Time (s) 


in the main figure). The y axis is in units of count rate divided by the 
mean of the entire light curve. This illustrates (in the time domain) that 
on average the 6-10 keV variability leads the 4-6 keV variability. We 
emphasize that the time delay seen in these coarsely binned light curves 
places an upper limit on the actual amplitude of the lag. The detailed 
Fourier analysis (Fig. 1) allows us to obtain a better estimate of the average 
time lag. 
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Extended Data Figure 3 | Suzaku XIS and HXD spectra. A ratio plot of 
the time-integrated energy spectra of the front-illuminated (black) and 
back-illuminated (red) XIS detectors and the HXD (green) to an absorbed 
power-law fit from 3-6 keV. The blue line at a ratio of one indicates where 
the data would be fully described by the absorbed power-law model. Error 
bars are at the 1a level. This fit exhibits the potential hardening in the 
spectrum above ~6 keV that appears to continue up to ~30 keV, beyond 
which the spectrum turns over. There is no obvious iron line in the XIS 
spectra. 
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Extended Data Figure 4 | Suzaku lag-energy spectrum. The lag—energy 
spectrum of the Suzaku observation taken 10 days before the XMM- 
Newton observation. An iron K lag at the same frequency and same energy 
as the XMM-Newton iron K lag is detected in the data. This confirms the 
presence of an iron K lag with an amplitude of ~120 s in two separate 
observations, taken with different instruments and analysed using 
different techniques. Error bars are at the 1c level. 
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Extended Data Figure 5 | Statistical significance of the XMM-Newton 
lag-energy spectrum. a, The observed lag-energy spectrum (black points; 
same as Fig. 1 except the zero-point lag has not been shifted) compared 
with the lo distribution of 10,000 simulated Monte Carlo light curve pairs 
in each energy bin with zero lag (blue shaded region). This plot shows 


Observed energy (keV) 


that the observed 0.3-10 keV lag energy is inconsistent with zero lag at 
>99% confidence. See Methods for details of the simulations. b, c, The 
best Gaussian (b) and asymmetric diskline (c) model fits (red lines) to the 
3-10 keV lag—-energy spectrum. We compute the Bayes’ factor between 
the models and find that the diskline is preferred at >98% confidence. 
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Extended Data Figure 6 | Swift J1644+57 lag-energy spectrum lag units that have been scaled so that all of the sources have the same 
compared with Seyfert galaxies. Overlay of the reverberation signatures iron K lag amplitude. We show the scaled lag because all of the sources 
in nearby variable Seyfert galaxies compared with Swift J1644+57 have different black hole masses (see Fig. 3). There is similarity in the 
(black diamonds). We show IRAS 13224—3809 in green, 1H 0707—495 in broadband shape, although the iron K line profile of Swift J1644+57 
blue and Swift J2127.4+5654 in purple. The y axis shows arbitrary time appears to be slightly narrower and peaks at higher energies. 
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Extended Data Figure 7 | Flux-energy and lag-energy spectra of reflection in super-Eddington flow. Flux-energy (left) and lag—energy (right) 
spectra derived from our toy model for iron xxv1 (rest-frame 6.97 keV) Ka line reverberation from an outflowing funnel wall. The results for terminal 
velocities of 0, 0.1c, 0.2c, 0.3c, 0.4c, 0.5¢ and 0.6c are shown (from left to right in peak energies). 
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Extended Data Table 1 | Observations used in this analysis 


Instument Date OBSID Exposure 
(YYYY-MM-DD) (s) 

XMM-Newton 2011-04-16 0678380101 25000 

Suzaku 2011-04-06 906001010 52000 


Asummary of the data used in this work, including the name of the telescope, the date of the observation, 
the OBSID unique identifier and the observation exposure length. 
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Extended Data Table 2 | Results of the fits to the lag—energy spectrum 


Gaussian Model Diskline Model 
Parameter Best-fit Domain | Parameter Best-fit Domain 
Line energy (keV) 5.3 [3, 10] Line energy (keV) 5.3 [3, 10] 
Width (keV) 0.67 [0, 3] Inclination (deg) 42 [0, 90] 
Normalization (s) 188 [0, 500] | Normalization (s) 167 [0, 500] 


Bayes’ Factor (Paiskiine/PGaussian): 55 
Probability: > 98% 


The free parameters in the Gaussian and diskline models. We show the best-fit parameters, the domain over which we step through the 
parameters, the corresponding Bayes’ factor and the probability. Note that for fitting in XSPEC we have shifted the lag amplitudes up by a 
constant value of 1,000 s so that none of the normalizations would be negative. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Table 3 | Lags and masses of Seyfert galaxies with reverberation detections 


Name Lag Amplitude Lag Frequency Mass 
(s) (10-4 Hz) (log(M/Mo)) 

NGC 4151 [10,48] 8801360 0.1-0.5 7.55+0.05 
1H 0707-495 [49,50] 47+16 10-30 6.4+0.5 
IRAS 13224-3809 [51,52] 299+135 1.4-2.8 6.6+0.5 
MCG-5-23-16 [18,52] 1037+455 0.1-10 7.9+0.5 
NGC 7314 [18,52] 77430 0.1-4 6.9+0.5 
Ark 564 [19,53] 92+65 3.2-5.2 6.4+0.5 
Mrk 335 [19,54] 193498 1.9-6.7 7.23+0.04 
PG 1244+026 [55,56] 762+306 0.9-3.6 7.3+0.5 
Swift J2127.4+5654 [57,58] 4074127 0.4-4.5 7.2+0.5 
NGC 1365 [35,59] 500+120 0.9-1.9 7.740.5 
NGC 4051 [60,61] 90+30 3.5-40 6.1+0.1 


The observed iron K lag amplitude (measured between 3-4 keV and 5-7 keV) with lo errors, observed iron K lag frequency range and black hole 
mass with lo errors are shown for all Seyfert galaxies with known iron K reverberation detections. The two references listed in brackets after each 
object correspond to the reference!®1819,35.49,51,55,57,60 for the iron K lag amplitude and frequency, and the reference#8:50.52-54.56,58,59.61 for the 
black hole mass. These data correspond to the plot shown in Fig. 3 (although the error on the mass has not been plotted, for clarity). 
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Origin and implications of non-radial Imbrium 


Sculpture on the Moon 


Peter H. Schultz! & David A. Crawford? 


Rimmed grooves, lineations and elongate craters around Mare 
Imbrium shape much of the nearside Moon. This pattern was coined 
the Imbrium Sculpture’, and it was originally argued that it must 
have been formed by a giant oblique (~30°) impact, a conclusion 
echoed by later studies”. Some investigators, however, noticed 
that many elements of the Imbrium Sculpture are not radial to 
Imbrium, thereby implicating an endogenic or structural origin**. 
Here we use these non-radial trends to conclude that the Imbrium 
impactor was a proto-planet (half the diameter of Vesta), once part 
of a population of large proto-planets in the asteroid belt. Such 
independent constraints on the sizes of the Imbrium and other 
basin-forming impactors markedly increase estimates for the mass 
in the asteroid belt before depletion caused by the orbital migration 
of Jupiter and Saturn®. Moreover, laboratory impact experiments, 
shock physics codes and the groove widths indicate that multiple 
fragments (up to 2% of the initial diameter) from each oblique 
basin-forming impactor, such as the one that formed Imbrium, 
should have survived planetary collisions and contributed to the 
heavy impact bombardment between 4.3 and 3.8 billion years ago. 

Previous interpretations for the Imbrium Sculpture included expres- 
sions of both structural features and secondary ejecta®”. Horsts and 
graben crossing the Imbrium basin rim (the Apennine Mountains)° and 
subsurface radial dikes detected from Gravity Recovery and Interior 
Laboratory (GRAIL) data® support this interpretation in certain loca- 
tions. Most of the features comprising the facies surrounding Imbrium, 
however, relate to ballistic emplacement”"”, Grooves and secondary 
craters with breached down-range rims record the momentum of ballistic 
debris but many do not radiate from the centre of the basin. Moreover, 
grooves that radiate from near the basin centre superpose those 
non-radial grooves and lineations originating well up range (Fig. 1a), 
in contrast to the traditional single-stage sequence of ejecta emplace- 
ment where the ejecta curtain advances outward systematically’®. 
Rimmed grooves (sculptured terrains) characterize first-arriving 
debris, whereas lower-relief radial lineations (mobilized ejecta deposits) 
and elongated secondaries generally characterize late-arriving ejecta. 
Previous studies combined both facies into the single term Imbrium 
Sculpture, but differences in expression, direction and scale suggest that 
part of the sculpturing component (as originally described in Earth- 
based telescopic observations) may have a different cause. 

The distribution of basin and ejecta asymmetries (see Extended 
Data Figs 1-4) indicates a northwest-southeast Imbrium-impactor 
trajectory!”. Mapped endpoints of each groove/secondary (lineation) 
defined great circles that were traced back until they intersected the 
great circle delineating the impactor trajectory. The selected northwest- 
southeast trajectory was then readjusted until all great circles from 
Imbrium-related ejecta converged within the basin. Sets of these great 
circles were then subdivided according to where they crossed the read- 
justed trajectory line: up range from the basin centre, down range from 
the centre, and within a radius of 300 km around the centre. Laboratory 
and hydrocode impact experiments previously documented evolving 


source regions where the ejecta launch positions migrated within the 
growing excavation crater (see Extended Data Fig. 5)'!"!°. The pattern 
becomes most evident when the impactor-diameter to crater-diameter 
ratio becomes large, for example, very large scales, oblique trajectories, 
strength-controlled targets or low-speed impacts. The pattern far to the 
east resembles one side of a butterfly pattern (the western counterpart 
masked by Oceanus Procellarum and Orientale ejecta) that reaches 
the lunar far side, about 2,700 km (90°) east of Mare Marginis. One 
set of lineations, however, never converges within the basin rim and 
parallels the optimized trajectory (Fig. 1b). We propose that this set is 
an expression of the failed impactor. Maps of the different sets of groove 
and lineation trends are included in Extended Data Figs 2-4. 

Hypervelocity impact experiments'»!? and numerical models'? 
reveal sets of ejecta trajectories converging well up range of the final 
crater centre, just as is observed for Imbrium. In impact experiments 
(~5kms7! at 15°) with solid targets, rays and surface scouring down 
range subtend an angle to either side of the trajectory comparable to the 
impact angle (Fig. 2b). The outermost set converges up range of the final 
crater rim and relates to the sheared-off portions of the impactor!*"*. 
Nearly parallel scours, however, originate from either side of the 
projectile (Extended Data Fig. 6). For large-body impacts, surface 
curvature becomes important, and portions of the projectile escape 
re-impacting the surface entirely (Fig. 2b). Captured fragments, crater 
sizes in down-range witness plates, and in-flight imaging in laboratory 
experiments (see also Extended Data Fig. 7) reveal that 5-8 large, 
intact and unmelted fragments survive, which is indicative of a non- 
catastrophic failure process’*. Collectively, the larger fragments com- 
prise about half the initial mass, each of which represents 40-46% of 
the original impactor diameter. 

This observed physical process in the laboratory should apply at much 
larger scales. A numerical simulation used the CTH shock-physics 
analysis package for a 100km monolithic dunite asteroid hitting the 
Moon at an impact velocity of 10kms~! at 30° from horizontal and 
incorporating a localized shear-failure material model (see Methods). 
Just as in experiments, projectile components scour the surface down 
range well before crater excavation and ejecta emplacement; moreover, 
their trajectories can be traced back to the projectile at the region of 
first contact, with some sets that extend back to the sides of the original 
impactor (Fig. 3a). This simulation closely matches the orientation of 
mapped grooves around the Moscoviense basin on the Moon (Fig. 3b 
and Extended Data Fig. 8b). 

Consequently, grooves and lineations comprising the Imbrium 
Sculpture reveal several important clues about the excavation 
process. First, the initial stage of excavation originated well up range of 
the basin centre, consistent with experiments and numerical models. 
Debris ejected at this stage has very high speeds and low angles directed 
generally down range’”!”. Second, sheared-off (decapitated) portions 
of the original asteroid retain speeds close to its original impact speed 
and extend far down range due to surface curvature'>-!*. Proposed 
remnants include heavily degraded elongate craters and grooves 
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Sandia National Laboratories, Albuquerque, New Mexico 87185, USA. 


21 JULY 2016 | VOL 535 | NATURE | 391 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Figure 1 | Views and trends of Imbrium Sculpture. a, Imbrium Sculpture 
from south of the 24-km-diameter crater Lalande in the centre of view 
(looking to the northwest) with the crater Mésting at the top right. Dashed 
white lines trace secondaries originating from near the basin centre cross 
or truncate grooves originating well up range from the centre of Imbrium 
(solid lines). Triangle identifies the Surveyor 6 landing site in Sinus Medii 
(top right). Apollo 16 metric frame (A16-M-1413). b, Convergence of 

all grooves and highly elongated secondary craters on an Imbrium- 
centred map in a Lambert-stereographic conformal projection centred on 
Imbrium (offset from centre here). Yellow triangle locates the Apollo 15 
landing site just inside the southeast Imbrium rim (white dot). Green lines 


beyond the southern rim of Imbrium (for example, Boscovich). 
Last, ejecta during later crater excavation stage (created by shock 
rarefactions off the free surface) become more radial to the centre 


a b 


correspond to subsets that converge along a great circle northwest of the 
basin centre. Red lines correspond to elongate secondaries converging up 
range of the basin centre, but a notable set of near-parallel trends up range 
(green) never converges within the basin. Black line indicates the proposed 
trajectory of the Imbrium impact (from the northwest); white dot indicates 
the basin centre. Image is a re-projected composite from the wide-angle 
Lunar Reconnaissance Orbiter Camera (LROG; circular region at top is 

an artefact around the North Pole). Sample size was determined by the 
ability to discern a clear orientation (based on length and expression) 

of each groove or elongated secondary crater, yielding over 230 mapped 
orientations (total set shown in Extended Fig. 3a). 


of the final basin and superpose scouring from the earliest stages. 
Whether caused by ballistic ejecta from low-speed impacts or grazing 
impactor fragments, the width of each groove or crater should be 


Figure 2 | An oblique impact experiment into a planar aluminium and 
cylindrical target by aluminium projectile impacting at 5.2kms~!. 

a, Scours originating from the point of first contact converge up range 
and result from projectile fragments re-impacting the surface downrange 
(impact angle was at 15° from the horizontal). b, An oblique impact into 
a curved surface prevents fragments from re-impacting the surface, as 

in planar targets. Fragments (arrows) surviving a5 kms! impact into 

a 6.7-cm aluminium disk (faintly outlined) by a 0.635-cm aluminium 


392 | NATURE | VOL 535 | 21 JULY 2016 


sphere (represented graphically). About 401s after impact (top), down- 
range fragmental debris (just above the yellow dotted line) separates from 
the vapour plume (blue) along the initial trajectory. The bottom image 
shows the evolution 151s later with 8-10 large fragments (1-3 mm across) 
travelling close to the same speed as the initial impactor. Additional 
images can be found in Extended Data Fig. 7. The experiments shown 
here have been replicated numerous (more than 30) times for both similar 
and different conditions of impactor speed and angle. 
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Figure 3 | Hydrocode simulation tracing impactor fragments from a 
basin-forming impact compared with scours from the Moscoviense 
basin. a, Hydrocode simulation of a 100 km asteroid impacting the Moon 
at 30° (dunite at 10kms~') resulting in the same non-radial pattern 
observed in hypervelocity impact experiments. Red coloration is the 
pattern of asteroid relics interacting with the target surface. b, Mapped 
scours (degraded) on the Moscoviense basin for comparison with the 
hydrocode model. Images without overlays are included in the Extended 
Data Fig. 8. c, Histograms generated from intersections between great 
circles extending a subset of grooves (not converging within Imbrium) and 
great circles orthogonal to the Imbrium trajectory at different distances 


controlled by the target strength and only a few times the size of the 
impacting debris (see Methods). 

Such observations indicate that debris scoured the surface down 
range before reaching the final transient cavity dimension, well before 
the arrival of ejecta from the excavation stage. Expressions of pre-ejecta 
scouring include breaches in the southeast Imbrium rim within the 
Apennines and the arcuate region southeast of Imbrium (extending 
from craters Julius Caesar to near Flamarian). Similarly, the extension 
of the inner ring and degraded scour zones crossing the outer ring 
in Moscoviense illustrate this process (Fig. 3b), in contrast with the 
up-range rim)”, 

The evolving flow field from an oblique impact accounts for the 
different groove orientations and enigmatic sequence of the Imbrium 
Sculpture. On the basis of experiments and models, certain sets of 
Sculpture probably originated from surviving impactor fragments and 
can be used to constrain the size of the Imbrium projectile, provided the 
Imbrium impact point can be determined. The farthest up-range loca- 
tion of the impact point must be well within the outermost Imbrium 
ring to the north (outer edge of Mare Frigoris). Groove trends up 
range intersecting the Imbrium trajectory line, however, constrain the 
farthest down-range location for the impact point, close to Montes Jura 
(Fig. 1b). 

To provide a quantitative estimate for the Imbrium impactor size, 
great circle segments were placed orthogonal to the trajectory at 
incremental distances up range from the basin centre, as illustrated in 
laboratory experiments (Extended Data Figs 2-4). Intercepts between 
the extended great circles and this series of orthogonal lines gener- 
ate a histogram as a function of distance from the basin centre (Fig. 
3c). Those trends never crossing the original Imbrium trajectory 
originated from the sides of the impactor that detached soon after the 
moment of impact. On the basis of this strategy, the diameter of the 
Imbrium asteroid is estimated to have been 275km, with statistical 
(and measurement) uncertainty constraining the outermost limit. If the 


LETTER 


Imbrium 


Wl | 
Wl aN 


uot 


is 8 8 
2 8 7 
rom basin trajectory 


g oT 
pistance f 


from the basin centre (out to 1,200 km). Top view shows all sets between 
400 km and 1,200 km from the basin centre, with down range to the 

top. The limits of intersections correspond to the sides of the projectile 
(A-A and B-B in the insets). Bottom view selects just those intersections 
between 500 km and 700 km from the basin centre where the impact point 
occurred. The diameter of the Imbrium impactor is estimated to have 
been between 220 km and 275 km in diameter. See also Extended Data 

Fig. 2d. Intersection points composing the histograms have observational 
errors that depend on the length and distance of the measured grooves and 
scours. Repeat measurements indicate errors no more than 30 km. 


shoulder of the distribution is used, then the derived diameter reduces 
to 220km. The best determination between these two values becomes 
~250km + 25 km. With the same approach, preliminary data indicate 
that the impactor diameters for the Schrédinger, Moscoviense and 
Orientale basins were ~45 km, ~100 km and ~110km, respectively”, 

The derived impactor diameters can be used to calculate the diameter 
of the apparent transient crater (see Methods). The apparent transient 
diameter here refers to the diameter measured at the pre-impact sur- 
face before collapse. For a non-porous, gravity-controlled scaling”, 
the Imbrium body (250km in diameter) impacting at 30° (25km s!) 
would result in a pre-collapse apparent transient diameter of ~850 km 
(compared to the observed diameter of ~1,250km) and closely 
corresponds to the next most inner ring, from Montes Alpes to inside 
the Archimedes plateau. Decreasing the impact speed to 15kms7! 
decreases the transient crater sizes to ~700 km. 

This reinterpretation of some components of the Imbrium Sculpture 
leads to other insights including impact speed, impactor signatures 
and sizes of proto-planets in the early Solar System. First, the derived 
sizes for the Orientale and Moscoviense impactors are nearly the same 
size, yet their basin diameters are very different (930km and 445 km). 
Since both were formed by oblique impacts, this difference in crater 
size now requires a low impact speed for Moscoviense, consistent with 
the hydrocode model (Fig. 3b). 

Second, impactor components surviving impact provide an expla- 
nation for meteoritic signatures within certain basins and high- 
land lunar samples (see Methods). A previous study suggested that 
portions of the impactor could survive low-speed vertical impacts 
(based on a two-dimensional model)”°. Here we find that the three- 
dimensional hydrocode model indicates that impactor components 
should line the basin interior but were buried by late-arriving ejecta 
beyond the rim. Down-range dispersal of the Imbrium impactor 
also could account for similar meteoritic compositions in Apollo 
16 regolith breccias”!. 
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Last, Imbrium is among the largest among all lunar basins created 
during the Late Heavy Bombardment (LHB) between 4.1 and 3.7 billion 
years ago. On the basis of our independent estimate for its diameter, 
the Imbrium impactor would have been ~2.5 x 10 g (for an aster- 
oid with a bulk density of 2.5 gcc~'). Consequently, the total mass 
intersecting just the Moon during the LHB significantly exceeded the 
10?! ¢ previously proposed. But the Imbrium impactor was only one 
of many proto-planets in the proposed E-belt, which was destabilized 
by late giant planet migration””. Large-body oblique impacts on both 
the terrestrial planets and asteroids should have generated large debris 
injected into planet-crossing orbits due to surface curvature, for exam- 
ple, Rheasilvia on Vesta, Hellas on Mars, Crisium, and even the earlier 
South Pole-Aitken basin'* on the Moon. Fragments as large as 2% 
of the original diameter survived these collisions (see Methods) and 
generated numerous multi-kilometre-size objects in planet-crossing 
orbits that would have led to numerous craters 20-50 km in diameter. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Imbrium grooves and elongated secondary craters. The Imbrium Sculpture 
reflects a sequence of arrival and source regions from Imbrium, even in the same 
region of the Moon, as illustrated by several examples. For reference, Extended 
Data Fig. 1 identifies the concentric rings of Imbrium defined by relic mountains 
and wrinkle ridges. The northern part of the outer ring (corresponding to Mare 
Frigoris) is poorly expressed due to collapse of the up-range rim. Extended Data 
Figure 2a includes a region southeast of Imbrium, close to the proposed trajectory. 
Farther west in this region, trends of some grooves diverge, trending more to the 
northwest. Extensions of these trends converge (and indicate a source) up range of 
the basin centre, characteristic of oblique impacts. This contrast in source regions 
becomes more apparent farther southwest (Extended Data Fig. 2b) and southeast 
of Imbrium (Extended Data Fig. 2c). Flow lines directed more radial to the basin 
centre overlap those secondaries and grooves originating up range. Up range from 
Imbrium (Extended Data Fig. 2d), grooves are narrower and closer to the basin 
than in other directions and also exhibit different source regions within the basin. 

Expression of the initial trajectory for each set varies with impact angle. At 

higher impact angles (>45°), returning ejecta create slightly elongate craters. At 
lower angles (<30°), they produce elongate secondary craters with breached down- 
range rims, as down-range-directed debris interferes with excavation. At very low 
impact angles (10°), however, speeds need to increase in order to achieve a given 
ballistic range, thereby scouring the surface and producing rimmed grooves and 
scours that retain signatures of their initial momentum. The long grooves and 
elongated secondaries comprising the Imbrium Sculpture indicate that the impact 
angles must have been at very low angles (<10°). 
Complete distribution of grooves and elongated secondary craters. The entire 
set of mapped grooves and elongated secondary craters with extensions along great 
circles towards Imbrium (Extended Data Fig. 3a). Some secondaries northeast of 
Mare Marginis (lateral) originate from a region up range of the basin centre. No 
statistical methods were used to predetermine the sample size. Many trends that 
converge up range from the basin centre are shown in Extended Data Fig. 3b. 
Source regions for a subset of grooves and elongate secondaries found up range 
of Imbrium cross the trajectory line well up range of the basin centre (illustrated 
in Fig. 1b and Extended Data Fig. 3c) and constrain the region of first contact by 
the Imbrium impactor to a region northwest of Sinus Iridum in what is now Mare 
Frigoris. Most trends, however, converge within 300 km of the centre of Imbrium 
(Extended Data Fig. 3d). The extended ballistic ranges to the east-northeast give 
the impression of a butterfly pattern, characteristic of an oblique impact, whereas 
the paucity of examples west and southwest reflect burial by later volcanic units 
of Oceanus Procellarum. 

Some mapped sets indicate a source region down range of the basin centre as 
the result of a migrating ejecta flow centre. Such flow migration characterizes 
the early stages of oblique impacts and becomes more evident at very large scales 
as cratering efficiency reduces due to the role of gravity limiting crater growth. 
A subset of trends, however, never crosses the trajectory line or converges beyond 
the up-range rim of Imbrium (Extended Data Fig. 4). These sets are interpreted as 
grooves and secondary craters created by decapitated fragments from the Imbrium 
impactor and are used to constrain estimates for its size. Those great circles deline- 
ating the lateral limit represent fragments from the sides of the impactor, whereas 
those close to the trajectory are from the top. We suggest that a portion of Rheita 
Vallis is actually associated with Imbrium, rather than Nectaris. A fraction of the 
initial kinetic energy is lost during large-body oblique impacts. In laboratory 
experiments’ (at 5kms ! to 6kms'), as much as 22% at 15° and 10% at 30° of the 
initial kinetic energy decouples from crater formation in planar impacts. Curved 
surfaces allow impactor debris to extend farther beyond the rim (or lost to space, if 
not re-accreted) before impacting the surface (shown in Fig. 2b). Such decoupling 
results in dramatic reductions in crater depth, in evidence from both laboratory 
experiments and numerical codes". 

Late-stage ejecta typically bury or mask earlier stages originating up range. 
Ejecta from Imbrium, however, are not simply deposited but are mobilized after 
striking the surface due to their initial momentum, thereby resulting in exten- 
sive mixing)”, Hence, deposit thicknesses are difficult to measure, if not predict. 
Mapping of the different sets directly down range (near Julius Caesar; Extended 
Data Fig. 2c) illustrates such burial or extensive modification. In other directions 
(Extended Data Fig. 2a), ejecta deposits may stagnate in lows, which exposes cross- 
ing patterns at higher elevations (for example, Sinus Aestuum, near the horizon in 
Fig. 1a). At greater distances from Imbrium, thinning of the ejecta deposits allows 
recognition of the different orientations more clearly (Extended Data Fig. 2b). 
Strategy for determining impact first contact and impactor size. Laboratory 
impact experiments capture ejecta in ballistic flight through a pulsed laser sheet". 
Extended Data Figure 5 illustrates the evolution of ejecta trajectories as they evolve 
during crater growth and clearly reveals that the source region for ejecta evolves 
from a position near the impact point (Extended Data Fig. 5a, red dot) to a position 
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close to the centre of the final crater (Extended Data Fig. 5b). In laboratory-scale 
experiments using loose particulates, the size of the region of coupling between the 
projectile and target is small relative to the final crater, best expressed at early stages 
of formation!”, Specifically, the ratio between crater diameter and projectile 
diameter ranges from 20:1 to 50:1, depending on impact angle and impact speed. 
For strength-controlled targets (Fig. 2a), however, this ratio ranges from 4 to 8, also 
depending on angle and speed. This ratio reduces as a function of increasing size 
for gravity-controlled growth’? and approaches that for a strength-controlled crater 
at basin scales. Hence, the evolving source region establishes a strategy not only 
for determining the point of impact but also constrains the size of the projectile, 
provided that the crater-to-projectile diameter ratio is small as illustrated in Fig. 2. 

The point of first contact is constrained by the convergence of up-range ejecta, 
whereas the size of the projectile can be determined from the convergence of 
the down-range ejecta (Extended Data Fig. 6a). Such pronounced asymmetries 
disappear through time in laboratory experiments using gravity-controlled 
particulate targets (Extended Data Fig. 5b). Trends established by scouring extend 
up range, where they cross two orthogonal lines placed at different distances from 
the up-range rim for both a 15° and 30° impact (Extended Data Fig. 6b). Crossing 
the orthogonal line at the up-range crater rim provides a reasonable prediction for 
the projectile diameter. In these images, the slight offset from the trajectory line 
is the result of the spin of the projectile due to rifling in the launch tube. This spin 
imparts a centrifugal force to sheared and spalled fragments as demonstrated by 
reversed offsets between left- and right-hand twist tubes. 

A subset of trends of impacting ejecta (grooves and elongate craters) northwest 

of Imbrium (up range) converges within the Atlas Mountains, but none originates 
northward of Mare Frigoris (Extended Data Fig. 3c). Consequently, the region 
of first contact must be confined to this region (Extended Data Fig. 3a). Those 
great-circle trends determined from inferred down-range ejecta trajectories, 
extending back to the region of first contact, then can be used to constrain an 
estimate for the lateral dimension of the object that formed this basin, independent 
of any a priori information about the impact angle or speed (only the trajectory 
needs to be assumed or determined), Results of the CTH moel of the Moscoviense 
impact further justify this strategy (Fig. 2b). 
CTH model. The average impact speed by asteroids at the Moon is ~23kms |, 
but for this numerical simulation (Fig. 3a) we assumed a relatively low-speed 
impact to demonstrate that the process does occur at much larger scales (and at 
only slightly higher speeds than in the experiments). The CTH model”? will be 
described more fully in a separate contribution but is summarized eleswhere!. 
This new modification includes a damage model” incorporating the pressure- 
dependent melt curve”». Specifically, the yield strength of fully damaged material 
follows a Coulomb friction law. As conditions approach the melting point, thermal 
softening of the bulk material occurs. The model also included a method to esti- 
mate strain-dependent statistical crack spacing, shear heating within cracks, and 
heat loss via conduction away from cracks. As the temperature within a crack 
approaches the melting temperature, these additions now incorporate thermal 
softening. The combination of these modifications to the code results in a Brittle 
Damage with Localized Thermal Softening (BDL) model. This model depends on 
crack spacing (L), which is assumed to follow a power law in strain rate (€), that is, 
L=Lé™". 

Here we chose Lo and n to produce crack spacing of 2-7 m for strain rates 

encountered during formation of terrestrial craters with diameters of 1-280 km. 
The BDL model predicts thermal crack widths of 0.3-3 cm and characteristic cooling 
times of 10-1,000s for the equation of state (EOS), strength and conductivity prop- 
erties appropriate for silicates. Frictional heating within each crack is proportional 
to pLéP, where 1 is the friction coefficient (which decreases with temperature) 
and P is pressure. With the crack spacing and widths used here, 0.1-1% of rock 
volume is affected by this process. Surface curvature is an important factor for 
tracking the fate of decapitated projectile fragments from the formation of a basin 
the size of Imbrium, but such a model would be computationally expensive. 
Instead, we modelled the Moscoviense impact to demonstrate the basic processes, 
sequence of arrival, and migrating source regions for impactor debris. 
Basin scaling. With the determined impactor size and reasonably constrained 
impact angle (20° to 30°), well-established crater scaling relations’? can be used 
to estimate the size of the transient (pre-collapse) cavity for Imbrium, both for 
consistency and comparison with observable surface features. 


D,/2r = (p/5)¥?k[(vsind)? /gr) (1) 


Variables are defined as the following: Dg, transient crater diameter limited by 
gravity; r, impactor radius; p, projectile density; 6, lunar density; v, impact speed; 
6, impact angle from the horizontal; k, an empirically derived material-dependent 
constant; a, an exponent dependent on the relative role of momentum and gravity 
controlling excavation; and g, gravitational acceleration of the Moon. The chosen 
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exponents and constants are based on experimental values for near-gravity scaling 
for loose particulates”® with the following: a = 0.51; and k= 1.07, a constant 
(including a shape factor for the diameter). The basis for this approach is that the 
initial collision generates a strong shock that converts the target into loose material 
under extension during the excavation stage. Wet sand scaling also has been used 
to represent gravity-controlled growth of non-porous materials!” with the same 
dimensionless scaling equation but different values for the exponent (a = 0.65) and 
constant (k= 0.8). These two approaches bracket the extrapolations. The average 
speed impact speed for the Moon today is around 23 kms“! but uncertainties 
require consideration of a range of possibilities. Comparison between these two 
scaling relations and the range of impact speeds is summarized in Extended Data 
Table 1. 

The two scaling approaches yield similar values of ~830-860 km (25 km s~') 
and ~700km (15kms~?) for the Imbrium transient apparent diameter (different 
from the final diameter found today). Prior estimates based on the ejected volumes 
of material® yielded a slightly smaller transient diameter of 685 + 88km, which 
could indicate a lower speed impact. More recent estimates”’ based on the annular 
crustal bulge and numerical models also indicate a range of transient diameters 
(~500 km to 740 km) that depends on the thermal profile of the crust at the time. 
In this case, the numerical model assumed both a vertical impact at 15kms~! and 
a much smaller impactor diameter (80 km). Transient diameters estimated by other 
studies can be found in that study. The much larger diameter derived from the 
observations here suggests that three-dimensional models with a curved surface 
are needed to estimate the Imbrium transient crater diameter. 

The derived transient rim diameters from the analytical scaling relations are 
also larger than recent estimates based on the positive gravity anomalies from the 
GRAIL mission and other gravity data®. This contradiction, however, can be rec- 
onciled if the interior gravity anomalies of Imbrium and other basins correspond 
to the shape and diameter of the deformed and displaced lower crust at depth”*”’, 
rather than a transient diameter referenced to the surface. Such an interpretation 
also would account for the offset of the gravity anomalies up range of the basin 
centre relative to the final basin rim, for example, Moscoviense, Orientale, Imbrium 
and Crisium!°, 

Estimates for the transient crater diameters for other basins can be made based 
on other observed impactor diameters using the same approach described here!’. 
For the peak-ring basin Schrédinger, the predicted transient apparent diameter is 
~200km (v=25kms~!, §=30°) based on extrapolation from sand experiments”° 
or 280km for wet soil'’. The observed (post-collapse) rim-rim diameter is 290 km; 
consequently, the latter calculation must represent an overestimate. Decreasing the 
speed to 15kms~! only partly corrects this discrepancy. For Moscoviense (445 km 
in diameter), an assumed impact speed of 25kms~' anda derived impactor diame- 
ter of 100 km in diameter yield nearly the same apparent transient crater diameters 
of 390 km (sand) and 380 km (wet soil). Both estimates, however, are unreasonable 
because rim collapse (25%) would increase the diameter beyond the observed 
diameter. Reducing the impact speed to 10kms~! (matching the hydrocode model) 
results in a rim-rim transient diameter of 350 km and collapsed rim of 440 km, 
which is close to the result from the CTH hydrocode model (485 km) for same 
inputs for projectile size, speed and angle. 

The final diameter of Imbrium depends on the selected physical expression. For 
example, the outer diameter (1,250 km) is delineated from the northern shores of 
Mare Frigoris to the Apennines. It should be pointed out, however, that the tran- 
sient crater diameter here (~830-860 km) is only about 3.5 times the impactor 
diameter. Such a small ratio reflects the role of gravity preventing ejecta from 
escaping the transient cavity. Nevertheless, high shock pressures extended well 
outside this limit and material still remained in motion, even though the crater 
would later collapse. Increasing the apparent transient diameter by 25% (for the 
transient rim-rim diameter) results in the calculated transient rim-to-rim basin 
diameter ranging from 1,040 km to 1,080 km, which would extend from the north- 
west rim of Sinus Iridum (region of first contact) to just inside the Apennines. The 
final crater, however, undergoes collapse. Up range (to the northwest), the transient 
crater profile would have been unstable due to over-steepening of the up-range wall 
and result in greater rim/wall collapse (for example, extending to Mare Frigoris). 
Shallower excavation down range’®, however, would have preserved the uplifted 
rim, now represented by the Apennines. 

Surviving impactor fragments and the Late Heavy Bombardment. Laboratory 
impact experiments can be used to assess the size and state of fragments surviv- 
ing oblique impacts. Oblique impacts into planar targets generate considerable 
down-range debris, largely derived from the impactor as recorded in witness plates 
placed down range’. Estimates for the size of the fragments can be derived from 
a combination of the size of the craters in a vertical witness plate (facing the debris 
down range) and the observed speed (from high-speed imaging) or the momentum 
generated by the collision. These experiments revealed two different processes: 
catastrophic disruption resulting in a strain-dependent power-law fragment size 


distribution; and non-catastrophic disruption following a very different distribu- 
tion revealed only at low impact angles (<15°). The strain dependence on the larg- 
est fragment size resembled the consequences for impact disruption of asteroids, 
the reverse of the case considered here. The non-catastrophic distribution, however, 
resulted in 5-8 similar size fragments travelling very close to the initial impactor 
speed. The sizes of these largest isolated fragments do not depend strongly on strain 
rate, consistent with these fragments formed by a different process (for example, 
shear) 4, 

The derived sizes, however, do not represent the initial fragment dimensions 
following the first contact because they impacted the surface down range and 
fragmented still further, before impacting the witness plate. Hence, this approach 
underestimates the true projectile-fragment size distribution, except at the lowest 
impact angles (<10°). Impacting a sphere or cylinder, however, prevents some frag- 
ments impacting the surface down range'®. Such experimental designs reveal that 
(1) some large fragments avoid re-impact with trajectories only slightly deflected 
from the initial trajectory, and (2) impacts into surfaces with sufficient curva- 
ture expose their true physical state and size following first contact (planar targets 
designed to avoid down-range impacts near the primary crater also preserve the 
original fragment sizes). 

We further illustrate these results in a sequence of images that capture the 
size, trajectory and physical state of projectile fragments revealed in the high- 
speed image sequence shown in Fig. 2b and more completely in Extended 
Data Fig. 7. The 0.635-cm-diameter aluminium sphere impacted a 6.7-cm- 
diameter disk. Although the launch angle was actually 30° (from the horizontal), 
the impact point is slightly down range from the top, thereby resulting in an effec- 
tive impact angle of 20° from the surface tangent on top of the disk. The collision 
resulted in about ten large fragments following close to the original trajectory, 
behind the vapour plume and jetting plasma. The largest fragment is about 0.2cm 
in diameter, among other fragments 0.1-0.2 cm, consistent with measurements 
from impacted witness plates’*. Smaller fragments leading this debris are cold (in 
silhouette) and near the initial trajectory, while others trail at slightly higher angles. 
In addition, a reddish glow surrounds this debris to either side of the trajectory 
and corresponds to a cloud of melt droplets. This distribution of debris is reflected 
in the arrival sequence observed on the witness plates with high-speed imaging. 

The physical state of the Imbrium impactor after the collision depends on the 
impact angle and whether or not it fully decouples before hitting down range. In 
general, the peak pressure should reduce as (sin@)” of the level for a vertical impact. 
For a 30° impact from the horizontal, this angle dependence means that the peak 
pressure would be reduced by a factor of 4. Two additional processes reduce the 
pressure even further. First, the density of the upper crust of the mega-regolith of 
the Moon reduces the impedance, thereby further reducing the peak pressure in 
the impactor. Second, the response in the impactor is more complicated than in 
the target because of the large amount of free surface, which reduces the expected 
peak pressures even further, along with the role of shear failure’*. A prior mod- 
elling study*” for a 5 km diameter at impact speeds comparable to this study for 
Imbrium (20kms!) calculated peak pressures in the projectile during impacts into 
planar targets and found that a 30° impact allows 16% of the projectile to remain 
as a solid, whereas a 15° impact allows 100% to survive, generally consistent with 
laboratory experiments!*. 

At the scale of the Imbrium basin, the widths of the parallel grooves down range 
provide a measure of the debris diameters based on crater scaling. At very low angle 
impacts (<5°) with speeds slightly less than the initial impactor speed, gravity-scal- 
ing relations for 10kms-! debris reveal that a 20-km-wide groove would have 
been produced by debris 4.5 km in diameter (equation (1)). At such low impact 
angles, however, groove widths should be controlled by strength scaling relations, 
which would predict even greater sizes for the debris. Grooves extending from 
near the region of first contact range from 10 km to 20km across. Consequently, a 
conservative estimate predicts that some of the larger down-range fragments from 
the Imbrium impactor were conservatively 2 km to 4.5 km across, that is, less than 
1-2% of the original diameter. Because a longer groove was probably not the result 
of a single impactor, grooves could represent more than one fragment while others 
could have escaped the Moon entirely. 

If the impacting bodies were differentiated, the core would have a different 
distribution relative to the crust. For example, if the Moscoviense impactor was dif- 
ferentiated into a core and mantle, three-dimensional models reveal that projectile 
remnants line the interior ring while the crustal component is generally distrib- 
uted beyond the rim”. This contrasts with prior results for the South Pole-Aitken 
basin, where the core is distributed beyond the rim*!. The difference in distribution 
reflects the size of the impactor relative to the radius of the Moon. 

More generally, recent studies indicate that ancient regolith breccias sam- 
pled at the Apollo 16 landing site have similar compositions (highly magnesian) 
poorly represented in current meteorite collections”!. After the LHB, meteoritic 
contamination in the Apollo regolith shifted to a greater range of compositions. The 
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results of the study presented here suggest that the Imbrium impactor would have 
dispersed its signature across the Southern Highlands and account for the observed 
unique meteoritic contamination of Apollo 16 regolith preserved in regolith brec- 
cias, reflecting post-Imbrium reworking*”. 

Consequently, large basin-forming collisions at angles less than 35° on the 
Moon (or any planet) should have generated considerable amounts of debris just 
from the surviving impactor mass that escaped the gravity field. The experiments, 
models and observations of groove width all show that this component should be 
composed of large masses (or weakly held masses) departing at speeds nearly the 
same as the original impact speed and along the original trajectory. This contrasts 
with excavated debris from the parent body that would have been heavily shocked 
to achieve comparable speeds and ejected over broad angles. Our observations 
indicate that the basin-forming impactors are much larger than those previously 
assumed to accommodate the surviving asteroids in the Hungaria region”!. One 
important implication, then, is that this picture may be incomplete. For example, 
the E-belt dynamical model assumed an arbitrary number of test particles and then 
compared the fit of those dynamically surviving particles constrained by assump- 
tions about crater and basin scaling. If our assessment is correct, then asteroids in 
the Hungaria region may be only a small fraction of the original total mass (and 
sizes), but not necessarily the number of bodies. Either the basin-forming bodies 
during the LHB did not all come from the proposed E-belt®?, or the E-belt was 
originally composed of larger bodies that underwent collisional fragmentation 
and dispersal during planet migration, a process that was acknowledged to be 
missing in that study. 
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Extended Data Figure 1 | Mapped rings and features of the Imbrium 
basin that reveal asymmetries due to the impact trajectory and 

identify features for reference in the discussion. The rings comprising 
the Imbrium basin (1), including an outer ring, corresponding to the 
Apennines (OR), a middle ring extending to the Alpes Mountains 

(MR), and an oblong interior ring (IR) delineated by isolated massifs 
(mountains) and wrinkle ridges. An additional scarp-ring extends farther 
to the southeast (dotted line). The poorly expressed outer ring to the 
northwest is detached from the boundary to the southeast due to enhanced 
rim collapse in the up-range direction. The middle ring is incomplete 


but is recognized by isolated massifs and re-direction of tectonic features 
(wrinkle ridges). The inner ring is oblong (northwest-southeast) but 
truncated to the southeast. Nevertheless, wrinkle ridges do extend along 
the proposed trajectory. Such a pattern is consistent with portions of the 
impactor being decapitated after first contact but continuing to interact 
with the near surface down range. The Apollo 15 landing site at the base 
of the OR is also shown. ‘S indicates the Serenitatis basin. (Base map is 
an orthographic projection of Lunar Orbiter Laser Altimeter (LOLA) 
reproduced with permission from M. Collins.) 
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Extended Data Figure 2 | Mapped trends corresponding to grooves, 
scours and lineations in different directions around the Imbrium 
impact basin on the Moon. a, Region south-southeast of the Imbrium 
basin centred (latitude —6.693, longitude 6.491) near the trajectory line 
with the degraded crater Hipparchus in the right centre. White solid lines 
illustrate trends that converge up range of the basin centre (close to the 
point of first contact), whereas yellow dashed line trends converge closer 
to the centre of Imbrium (crater excavation). b, Region farther west from 
the trajectory line for Imbrium and south of Fig. 1a (centred at latitude 
—14,823, longitude —0.798). Certain grooves (white) have trends directed 
northwest (source up range of basin centre), and indicate first-arriving 


LETTER 


debris, in contrast to other sets (yellow), which are radial from the basin 
centre and correspond to later-arriving material. The crater Alphonsus is 
in the top left. c, Grooves and elongate secondaries near the Julius Caesar 
crater (bottom centre) and Boscovich craters (bottom left) southeast 

of Imbrium (latitude 12.496, longitude 15.681). Grooves/secondaries 
trending more northwest (solid white lines radial to the Imbrium) 
superpose those trending more westerly (dashed yellow lines converging 
on the proposed point of first contact). d, Region northwest (up range) of 
Imbrium (latitude, 68.304, longitude — 114.03). The crater at the very top 
(only partial) is named Niepce. Most sets here indicate a source region 
towards the basin centre. 
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Extended Data Figure 3 | Mapped lineations extended as great circles 
into the Imbrium basin to assess sources during formation. a, Great 
circles extended from trends established by over 230 mapped grooves and 
elongate secondary craters related to the Imbrium basin (white circle). 
Locations of different Apollo landing sites are also identified. Lambert- 
conformal stereographic projection centred on Imbrium. b, Subset of 


trends from down-range secondaries that cross the trajectory more than 
300 km northwest of the Imbrium centre. c, Subset of trends north of 
Imbrium intersecting the trajectory line well up range from the basin 
centre. d, Subset of great circles that cross the inferred Imbrium trajectory 
within a 300km radius of its centre. The most distant secondaries have 
trends that converge slightly up range. 
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Extended Data Figure 4 | Subset of great circles of trends associated the impactor (Fig. 3c). Circles correspond to basin rings defined by relict 
with down-range secondaries and grooves that cross the trajectory massifs and wrinkle ridges. IR, inner oblong ring; MR, middle ring; OR, 
300 km up range of its centre. One subset forms trends nearly paralleling outer ring, which is delineated by two different arcs resulting from the 
the trajectory, consistent with sheared impactor fragments. Coloured up-range rim collapse. 

lines orthogonal to the trajectory (black) are used to constrain the size of 
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Extended Data Figure 5 | Evolution of trajectories from hypervelocity velocimetry) were captured in multiple cameras to establish in-flight 
oblique impact experiments, illustrating the evolving source region trajectories at different times of crater growth. Modified from ref. 13 (a) 
during excavation. a, Early trajectory. b, Late trajectory. Packets of ejecta and ref. 11 (b). 


passing through a horizontal laser beam (using three-dimensional particle 
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Extended Data Figure 6 | Strategies for determining the position of 

the impact point from mapped patterns on a target surface. 

a, Mapped trends of grooves and elongated trajectories down range that 
do not converge near the crater centre allow estimation of the size of the 
projectile. Convergence of trends from the up-range direction constrains 
the region of first contact. Convergence of trends intercepting a line 
orthogonal to the trajectory at the location set by the region of first contact 
constrains the diameter of the projectile. b, c, Convergence diagrams for 
15° (b) and 30° (c) hypervelocity impacts into aluminium by 0.635 cm 
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aluminium spheres onto planar targets. Grooves created by hypervelocity 
projectile debris converge and cross a reference line perpendicular to 

the trajectory at the up-range rim of the crater (line A) and 1.5 projectile 
diameters up range (line B). Well beyond the up-range rim (line B), the 
spray originates from spalls and shears from either side of the projectile 
and travels down range with little relative velocities directed away from 
the initial trajectory. Therefore, these components can be used to estimate 
the impactor diameter, consistent with 0.635 cm (indicated by the scale bar 
below). 
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Extended Data Figure 7 | Fragmental debris resulting from a 0.635 cm 
aluminium sphere impacting a 6.7 cm aluminium disk at ~5kms—!. 
a-d, The launch angle was 30°, but because the impact point was not at the 
top of the disk, the actual impact angle was 20°. a, The top left frame shows 
the jetting phase (blue-white tongue-like extension). b, The jetting phase 
(151s later) shows the jetting phase, now a faint blue haze, and vapour 
(turbulent blue gas) with fragments emerging near and above the yellow 
dashed line (the crater seen below the impact on the disk was from a prior 


experiment). c, Still later, leading fragmental debris down range (near 

the yellow dotted line immersed in plasma) separate from the vapour 
phase along the initial trajectory. d, The fourth frame shows 8-10 large 
fragments (1-3 mm across) travelling close to the same speed as the initial 
impactor. Experiments were performed without artificial illumination and 
shows that the fragments are not notably heated. The aluminium sphere is 
represented graphically, and the aluminium disk is faintly outlined. 
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Extended Data Figure 8 | Comparing results of the CTH numerical (not shown) cover this evidence as identifiable units but their effects 


model and the Moscoviense basin on the Moon from corresponding (grooves) remain preserved. b, The shaded relief map from the Lunar 
LOLA-DEM data without mapped lineations. a, The CTH model tracked _ Laser Altimeter is shown here without the mapped lineations shown in 
where impactor fragments intersected the surface (in red). Beyond the Fig. 3b. Mapped lineations in Fig. 3b were based not only on topographic 
rim, the linear red areas point towards the region of first contact, rather data but available images from Lunar Orbiter, Apollo and LROC wide- 


than the centre of the basin. See Fig. 3a for mapped lineations. Later ejecta. angle cameras. 
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Extended Data Table 1 | Comparison of selected transient basin diameters calculated from two scaling equations 


Assumed metal til Impactor 
Basin P rim diameter - pat Sand* Wet Sand* 
impact speed (km) diameter* (km) 
Dia (km) Diz (km) Dia (km) Diz (km) 

Schrddinger 290 45 

25 km/s 200 (250) (280) (440) 

15 km/s 170 210 230 350 
Moscoviense 445 100 

25 km/s (390) (480) (380) (480) 

15 km/s 320 (400) (340) (420) 

10 km/s 280 350 280 350 
Orientale 930 110 

25 km/s 420 520 460 570 

15 km/s 350 440 360 460 
Imbrium 1250 250 

25 km/s 830 1040 860 1080 

15 km/s 700 880 690 860 


Dia is the apparent transient crater diameter (referenced to the pre-impact surface). Dir approximates the rim-rim transient crater (pre-collapse); rim collapse increases the diameter by another 25%. 
Parentheses indicate unreasonable values. Impact angles are assumed to be at 30° (except where noted). Values are rounded to two figures. 

*Estimates for impactor diameter for Schrodinger and Orientale are from ref. 17. 

tConstants and exponents from ref. 27. 

Constants and exponents from ref. 19. 
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Electron attraction mediated by Coulomb repulsion 


A. Hamo!, A. Benyamini!*, I. Shapir!, I. Khivrich!, J. Waissman!+, K. Kaasbjerg!t, Y. Oreg!, F. von Oppen? & S. Ilani! 


One of the defining properties of electrons is their mutual Coulomb 
repulsion. However, in solids this basic property may change; for 
example, in superconductors, the coupling of electrons to lattice 
vibrations makes the electrons attract one another, leading to the 
formation of bound pairs. Fifty years ago it was proposed! that 
electrons can be made attractive even when all of the degrees of 
freedom in the solid are electronic, by exploiting their repulsion 
from other electrons. This attraction mechanism, termed ‘excitonic, 
promised to achieve stronger and more exotic superconductivity?®. 
Yet, despite an extensive search’, experimental evidence for excitonic 
attraction has yet to be found. Here we demonstrate this attraction 
by constructing, from the bottom up, the fundamental building 
block? of the excitonic mechanism. Our experiments are based on 
quantum devices made from pristine carbon nanotubes, combined 
with cryogenic precision manipulation. Using this platform, we 
demonstrate that two electrons can be made to attract each other 
using an independent electronic system as the ‘glue’ that mediates 
attraction. Owing to its tunability, our system offers insights into 
the underlying physics, such as the dependence of the emergent 
attraction on the underlying repulsion, and the origin of the pairing 
energy. We also demonstrate transport signatures of excitonic 
pairing. This experimental demonstration of excitonic pairing paves 
the way for the design of exotic states of matter. 

Following the development of the Bardeen—-Cooper-Schrieffer (BCS) 
theory of superconductivity’, W. A. Little’ proposed the idea that two 
electrons can attract each other not via phononic-based attraction, 
but via their repulsion from other electrons. He predicted that if elec- 
trons, which are much lighter than ions, mediate the attraction, then 
the electron pairing would be much stronger than in conventional 
superconductors. To test this new form of attraction, Little proposed 
a one-dimensional conducting organic chain (Fig. la, green) that has 
an array of polarizable sidechains (‘polarizers; purple). Each polarizer 
has a single electron that can hop between two sites—one closer to and 
one further away from the main chain. Owing to Coulomb repulsion, 
an electron travelling down the main chain polarizes the side chains, 
which in turn attracts another electron in the main chain. This mech- 
anism rapidly became popular in attempts to engineer unconventional 
superconductivity; it was extended to two dimensions~*, generalized 
to attraction at localized sites'”"“ and used in early attempts to explain 
high-T. superconductivity’’. It is considered a candidate for the unusual 
superconductivity'®!” and pairing'® observed in SrTiO; interfaces, 
and it has analogues in optical systems’. Numerous attempts have 
been made to directly synthesize organic materials that have the 
essential microscopic components’; however, so far there has not 
been any experimental evidence for an excitonic attraction between 
electrons. 

At the core of Little’s proposal is the idea that electrons separate 
into two groups: some form the ‘systeny (Fig. 1a, green), where their 
mutual interaction is to become attractive, while others make up 
the surrounding ‘medium’ (purple) that produces the ‘glue’ for the 
attraction. To make electrons attractive, this medium should perform 


the unusual feat of flipping the sign of the potential generated by 
the system electrons, making them look like holes to other system 
electrons (Fig. 1b). This suggests that the medium should effectively 
have a negative dielectric constant. Little suggested that this property 
can be achieved in the dynamic (retarded) limit, in which the sys- 
tem electrons respond faster than the medium, thus leaving a positive 
polarization cloud in their wake that creates an exponentially weak 
BCS-like pairing. However, Hirsch and Scalapino showed? that the 
interesting regime of Little's model is instead the static (instantaneous) 
limit. Only in that limit does superconductivity dominate over 
competing orders (such as charge and spin density waves) and the 
superconducting phase exhibit strong Bose-Einstein-condensation- 
like pairing. 

Here we demonstrate that excitonic pairing is indeed possible 
and that the key ingredient is the discreteness of the electrons in the 
medium. We take a bottom-up approach and construct the minimal 
building block of Little’s model that features excitonic attraction’—a 
two-site system and a single polarizer (Fig. la, dashed circle)—and 
show that when the polarizer interacts strongly with the system, it can 
render its electrons attractive. 

The system and polarizer are created within two separate carbon 
nanotubes, each on its own microchip. We use our recently devel- 
oped nano-assembly technique” to suspend each nanotube between 
two metallic contacts and above an array of gates (Supplementary 
Information section S1). In both devices we bias the gates to produce a 
double-well electrostatic potential along the nanotube length (Fig. 1c). 
In the polarizer device, the energy levels in the two wells are placed far 
from the chemical potential in the leads, but close to each other, such 
that a single electron can hop between the wells, but cannot escape to 
the leads. The polarizer thus operates as an isolated dipole whose sole 
degree of freedom is the polarization of its electron (Supplementary 
Information section $2). Conversely, in the system device, a large 
central barrier inhibits tunnelling between the two wells, but small 
side barriers allow electrons to enter from their corresponding leads. 
On a separately contacted side segment of the system nanotube, we 
create an independent quantum dot that serves as a charge detector, 
detecting the population of individual system electrons via weak elec- 
trostatic coupling. We then mount the two chips, oppositely facing 
and perpendicular to each other, in a custom-built scanning probe 
microscope inside a dilution refrigerator (Fig. 1d). The microscope 
allows us to control the distance and the coupling between the polar- 
izer nanotube and the system nanotube (Fig. 1d inset) and, hence, to 
test whether the polarizer alters the behaviour of the system electrons 
in a fundamental way. 

We first measure the charge stability diagram of the bare system, 
without the polarizer. Using the left and right gates of the system, we 
scan the potential detuning between the two wells, 8V=(V,— Vr)/2, 
and their mean potential, V=(V, + Vp)/2. Changes in the electronic 
occupation of the system appear as steps in the charge-detector cur- 
rent, Icp (Supplementary Information section $2). This measurement 
(Fig. 2a) yields the well-known charge stability diagram of a double 
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Figure 1 | Model system and experimental realization of its 
fundamental building block. a, The organic-molecule model system 
proposed by Little’, consisting of two parts: the ‘system; a one-dimensional 
conducting chain (individual lattice sites marked in green), and the 
‘medium, an array of side-chain ‘polarizers’ (purple) each having a single 
electron that can hop between a site close to, and a site further away 

from, the chain. The fundamental unit block that manifests attraction is a 
two-site system with one polarizer (dashed circle). b, In a bare electronic 
system (top) an electron creates a repulsive Coulomb potential (green). 
Embedding it in a medium that flips the sign of its potential (bottom) will 
make this electron attractive to other electrons. c, Implementation of the 
two components (‘polarizer’ and ‘system’) that make up the fundamental 
building block. These are fabricated as two separate devices, each having 
a pristine nanotube assembled on contacts (yellow) and suspended above 
an array of gates (blue), which set the potential landscape for the electrons 
(grey) (see Supplementary Information section S1 for dimensions). The 
polarizer device (top) has two gate voltages, Vg and V+, that control the 


quantum dot with repulsive electrons (explained in Supplementary 
Information section S3). Tilted charging lines reflect the addition ofa 
single electron to the right or left dot. Near their crossing, the charging 
lines exhibit a vertical shift (Fig. 2a, dashed black line), reflecting the 
fact that if one dot is occupied the energy to populate the other dot 
is increased by the Coulomb repulsion between the neighbouring 
electrons. This vertical shift is thus a direct fingerprint of the nearest- 
neighbour electron repulsion, and its magnitude normalized to energy 
units yields the strength of this repulsion, W = 830 eV. 
Fundamentally different behaviour is observed when the polarizer 
is brought into close proximity to the system. In this case, we measure 
a charge stability diagram with an interaction line that is rotated from 
vertical to horizontal (Fig. 2b). This rotation implies that the polarizer 
inverts the interaction between the system electrons from repulsive 
to attractive. The inversion is best understood by realizing that the 
ground states at the centre of the two charge stability diagrams are 
fundamentally different: in the bare system it is a degenerate state 
between having a single electron on the left or the right dot (labelled 
(1, 0) and (0, 1)); with the polarizer nearby it becomes a paired ground 
state having a degeneracy between the two states with an even electron 
number (labelled (0, 0) and (1, 1)). The odd states, (1, 0) and (0, 1), 
become excited states, separated from the ground state by a pairing 
gap A. This gap is maximal at the centre of the horizontal vertex, 
where its magnitude is given by the length of this vertex (2A after 
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potentials of the bottom and top dots (purple), and is operated as an 
isolated dipole, whose sole degree of freedom is that of a single electron 
hopping between the sites. The system device (bottom) has two gate 
voltages, Vj, and Va, that control the potentials of the right and left dots 
(green). Here the central barrier is opaque and the side barriers are 
relatively transparent such that electrons enter the two dots from their 
corresponding leads. An additional dot on a side segment of the same 
nanotube (blue) serves as a single electron transistor charge detector: 

a voltage bias across it, Vcp, leads to current, Icp, that is sensitive to the 
population of the system dots through weak electrostatic coupling. 

d, A custom-built scanning probe microscope (centre), operating inside 

a dilution refrigerator, brings the two oppositely facing devices into 
proximity (about 100-150 nm apart) such that the polarizer nanotube is 
perpendicular to the system nanotube (right) and such that one of its dots 
is directly above the system while the other is further away (left), creating a 
structure that is analogous to that of Little’s molecule. 


normalization; see Supplementary Information section S4), reaching 
A=790peV ~8K. 

A medium can transform repulsive system electrons to attractive 
electrons only if it can flip the sign of the electrostatic potential pro- 
duced by these electrons. The Coulomb potential of a bare electron is 
positive in its local and neighbouring sites, implying on-site and near- 
est-neighbour repulsion (Fig. 2c). If the on-site repulsion is retained 
while flipping the potential sign at the nearest-neighbour site, then 
the system remains stable, but acquires nearest-neighbour attraction. 
However, any medium based on continuum classical electrostatics can 
at best screen a potential to zero, but cannot flip its sign (Fig. 2d). We 
therefore seek to understand what the special feature of our polarizer 
medium is that enables such sign inversion. 

The key element for a sign-inverting medium is charge discreteness”! 
and, more fundamentally, the fact that a single electron does not repel 
itself. To understand this, compare the screening by a single-electron 
dipole to that of a metallic object of similar geometry (Fig. 2e). Ina 
small external field, the charge in the metal will polarize slightly to 
exactly null the internal field. Conversely, in a dipole with small inter- 
site tunnelling, an entire electron will polarize between the two sites, 
creating an internal field that is much larger than, and of opposite sign 
to, the external field. This behaviour is the basis for our basic ‘sign- 
inverter. Unlike continuous charge in a metal that experiences its own 
field and thus minimizes the electrostatic energy by nulling the internal 
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Figure 2 | From repulsive to attractive electrons. a, Measured charge 
stability diagram of the bare system: charge detector current (Icp, 

colour scale) plotted as a function of the voltage detuning between its 
right (R) and left (L) sites, ’V=(V, — Vp)/2, and the mean gate voltage, 
V=(V,-+ Vp)/2. Steps in Icp (dashed grey lines) correspond to single 
electrons populating the L/R sites (green/white circles label an electron 
presence/absence). The middle vertical shift (black dashed line) is a direct 
measure of the Coulomb repulsion between the neighbouring electrons 
(see text). b, Similar charge stability diagram to a, but with the polarizer 
nanotube positioned nearby (approximately 125-nm separation between 
the nanotubes). The interaction vertex is now horizontal, reflecting an 
attraction between the electrons. Along this line (dashed black) the ground 
state is degenerate between the two even states (0, 0) and (1, 1) ((mz, mp) 
represents the number of electrons in the L and R dots), whereas the odd 
states (1, 0) and (0, 1) are the excited states, separated by a pairing gap A 
from the ground state. This gap is maximal at the centre of the horizontal 
line, and its magnitude there is equal to the length of this line (2A when 
normalized to energy units; see Supplementary Information section $4). 
Note that the charge detector is far away from the right dot and that when 


field, a single electron does not feel its own field and so can generate 
a large over-screening internal field, as long as this minimizes its 
single-particle energy. 

To analyse the sign inversion microscopically, for the geometry of 
our system, we describe the polarizer by the Hamiltonian 
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are creation (annihilation) operators of electrons in the two polarizer 
sites (“T; top; ‘B, bottom), 6= ey — €g is the energy detuning between 
these sites, ft is the tunnelling amplitude, and we ignored a constant 
energy offset. The repulsion between system and polarizer electrons 
leads to a charge-dipole coupling 
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the polarizer is close their mutual capacitance is strongly screened and so 
the charging lines of this dot are harder to observe. c, In the bare system, 
an electron populating the L site generates a Coulomb potential (green) 
that is positive (repulsive) in both the L and R sites. d, Embedding the 
system in a medium based on continuum electrostatics (purple) can at 
best screen the potential to zero far away from the electron, but cannot 
flip its sign. e, The key element for sign inversion is charge discreteness: 

a single-electron dipole (right) and a similarly shaped metal (left) will 
screen an external field differently (grey arrows). In the latter the internal 
field is nulled, whereas in the former it is larger and of opposite sign to 
the external field (red arrows; over-screening). This behaviour is rooted 
in the fact that an electron does not repel itself (see text). f, With a nearby 
polarizer, an electron charging the system gets dressed by the polarization 
(grey ellipse). The electrostatic potential (green) of the dressed particle 
will be substantially different to that of the bare electron (calculated in 
Supplementary Information section S4). Note that all the measurements 
presented here were done with single holes instead of single electrons, but 
to avoid unnecessary confusion we presented the physics in the language 
of electrons. 


in which ny, and ng are the populations of the left and right system sites, 
U is the repulsion between an electron in one of these sites and an 
electron in the bottom site of the polarizer (assuming negligible inter- 
action with the top site), and f = Wiab + Whabp. This coupling dresses 
an electron entering the system with a dipolar excitation of the polarizer 
(Fig. 2f). In the strong coupling limit® (U > ¢, applicable to the meas- 
urements in Fig. 2b), the electron is dressed by a fully polarized dipole, 
and the effective potential of the dressed particle is considerably differ- 
ent from that of the bare electron (Supplementary Information section 
S4). If the charge—dipole coupling exceeds the charge-charge repulsion 
within the system 


Fyrepulsion = Wnyng (3) 
that is, U> W, then the dressed potential at the neighbouring site has a 
flipped sign, implying that another electron is attracted to the dressed 
particle. This attraction reflects the fact that the neighbouring electron 
is more strongly attracted to the vacancy formed in the bottom site of 
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Figure 3 | Dependence of pairing energy on the polarizer detuning and 
the origin of the pair binding energy. a—c, Charge stability diagrams 
similar to those in Fig. 2b, measured for different energy detunings of 

the polarizer: 6=0.39 meV (a), 1.01 meV (b) and 2.57 meV (c). The 
observed attraction increases linearly with 6 (more data in Supplementary 
Information section $6). The insets show the polarizer potential wells for 
the different detuning values. d, e, Rationalizing this observation using 


the polarizer than it is repelled by the original electron that triggered 
the polarization. 

A central prediction of the above model is a direct relationship 
between the emergent attraction and the underlying repulsion. We 
test this prediction by changing the separation between the nano- 
tubes, which modifies the repulsion between system and polarizer 
electrons (U) while keeping the repulsion between the system electrons 
(W) fixed. The measured dependence of 2A on U (Supplementary 
Fig. 6) shows a linear dependence with unit slope. This result follows 
from equations (1)-(3) in the strong coupling limit (Supplementary 
Information section $4) and is a demonstration that the observed 
attraction is driven by repulsion. 

Because the ability of the polarizer to dress an electron depends on 
its polarizability, one might expect that the optimal polarizer detuning 
for creating the strongest pairing would be at zero detuning (5=0). At 
this detuning, the electronic wavefunction of the polarizer is symmet- 
rically split between its top and bottom sites and its polarizability is 
maximal. However, the measured charge stability diagrams at different 
detunings (Fig. 3a—c) show that the attractive correction to the bare 
repulsion is not maximal at 6=0, but rather increases linearly with 
6. This observation captures an important point about the origin of 
the pair binding energy, which is most easily rationalized by consid- 
ering a toy model of a multisite chain (Fig. 3d, e): two far-apart elec- 
trons on the chain would each be dressed by two adjacent polarizers 
(Fig. 3d). In contrast, nearest-neighbour electrons share one polarizer 
and thus gain the polarization energy of one polarizer (Fig. 3e). This 
energy gain constitutes the pair binding energy, which is exactly equal 
to 6, the stored energy in a polarizer. This is indeed what we observe 
experimentally. The same mechanism of sharing the dressing costs, 
leading to a pairing energy of magnitude 6, also holds in our two-site 
case, as we explain in Supplementary Information section S6. 

Beyond the ground-state stability diagram, we also examine whether 
pairing is reflected in transport, that is, whether electrons enter the 
system in pairs. In our set-up, we can measure the current through the 
polarizer and deduce the behaviour of the system via its strong correla- 
tions with the polarizer. To simplify the measurement, we open the bar- 
rier to one of the polarizer dots to strongly couple it to the lead, making 
the polarizer effectively a single quantum dot coupled to two leads 
(Fig. 4a). Now, the polarization is between dot and lead rather than 
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a toy model of the chain: two spatially separated electrons in the chain 
are each dressed by the polarization of their two adjacent polarizers (d); 
when the electrons are nearest neighbours, they share the centre polarizer 
and thus need to polarize one fewer polarizer (e). The energy gain, which 
gives the pairing energy, is consequently equal to 6, as we observe in the 
experiments in a-c. A similar polarizer-sharing argument holds for the 
two-site case, as is explained in Supplementary Information section S6. 


between two dots. Although these two cases are electrostatically quite 
similar, they are fundamentally very different: whereas the excitonic 
polarization of an isolated double dot preserves the overall fermion 
parity of the combined system and polarizer, tunnelling of an electron 
to alead in the polarizer does not. Yet, establishing pairing also in the 
latter configuration opens possibilities to create quantum systems with 
engineered dissipation, which may help to stabilize one-dimensional 
superconductivity~. 

Figure 4b shows the measured polarizer current, Tob with source-— 
drain bias Vsp = —1.3 mV as a function of its local gate voltage, Vp, 
exhibiting a standard Coulomb blockade peak when the occupation 
of the polarizer dot is free to fluctuate. Owing to the strong coupling 
to the system, the peak position depends strongly on the electronic 
occupation of the system (Fig. 4b). If we fix Vp at 241.5 mV (dashed 
line in Fig. 4b), then the polarizer conducts for the odd system occupa- 
tions (1, 0) and (0, 1), but is blocked for the even states. The Coulomb 
peak is pushed well below the Fermi energy for the (0, 0) state and well 
above it for the (1, 1) state. This behaviour persists throughout the 
charge stability diagram of the system (Fig. 4c, Vsp = 100 1V), show- 
ing finite current (red) for odd states and negligible current (blue) 
for even states. Notably, the attraction survives even in the dynamic 
regime with tunnelling rates being a substantial fraction of the repul- 
sion (Ipo/ U0.2, [R/U 0.45 and [1/U 0.23, where I} x pol are the 
tunnelling rates to the left and right contacts of the system and to 
the polarizer contacts, respectively). Although neither the (0, 0) nor 
the (1, 1) states support single-particle current, we observe a finite cur- 
rent peak along their degeneracy line”*4 (indicated by the arrow in 
Fig. 4c). Theoretical calculations (Fig. 4d) reproduce this peak only 
when including correlated many-body events in which an elec- 
tron hopping in and out of the polarizer is accompanied by a pair of 
electrons simultaneously co-tunnelling out of and into the system 
(Supplementary Information section $7). 

To better understand the processes underlying the observed current 
peak, we measure the current along a cut perpendicular to the degen- 
eracy line, as a function of aV (where a, the independently measured 
lever-arm, normalizes V to energy units) and for varying Vsp (Fig. 4e). 
In different bias ranges we observe conductance regimes that differ 
in their turn-on slope, s = dVsp/d(aV), which reflects the number of 
system electrons participating coherently in the dominant conductance 
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Figure 4 | Transport measurements. a, Experimental configuration. The 
polarizer current, [,01, is measured with a finite bias on one lead, Vsp. The 
barrier between this lead and its nearby dot is reduced such that the dot 
becomes strongly connected to the lead, effectively making the polarizer 
nanotube a single quantum dot device with two leads. Compared to the 
experiments presented in Figs 2 and 3, in which the polarization occurred 
internally between two dots, here the polarization is between the dot 

and its lead. b, J). measured as a function of the local gate voltage, Vp, 

for four different population states of the system: blue, (0, 0); red, (1, 0); 
orange, (0, 1); green, (1, 1). In these measurements, the system charge 
remained fixed by staying far away from the charging lines of the system. 
This measurement was performed at a high bias, Vsp = —1.3 mV, which 
widens the observed charging lines. The vertical dashed line indicates the 
gate voltage Vx used in c. ¢, I,,) (colour scale) measured at Vsp = 100 1V 
as a function of the voltage detuning between the L and R system sites, 
5V=(V, — Vr)/2, and the mean voltage, V=(V,+ Vp)/2. Finite current 
(red) is observed for the odd system states, (1, 0) and (1, 0), and negligible 
current (blue) is measured for the even states, (0, 0) and (1, 1), except for 
a special peak of finite current (indicated by the white arrow) appearing 
along their degeneracy line. d, A theoretical master-equation calculation 


process (s=0, 1 and 2 corresponds to processes with zero, one and 
two system electrons, respectively). At high bias, Vsp > 925 1eV & U, 
we observe s 0 (Fig. 4e, black line), showing that a polarizer electron 
with enough energy to overcome its interaction with the system elec- 
trons (U) can flow without their cooperation. At intermediate bias, 
Vsp > 490 eV, the observed slope is s+ 1 (Fig. 4e, blue line), reflecting 
simultaneous co-tunnelling of one system electron. Below this, and 
down to zero bias, the slope is s +2 (Fig. 4e, red line), indicating that at 
low energies a pair of electrons co-tunnels into the system in concert 
with an electron tunnelling out of the polarizer. Theoretical calcula- 
tions (Fig. 4f and Supplementary Information section $7) reproduce 
the main features, although they give a smaller two-electron current 
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of Ipoi for the parameters of the experiment (Supplementary Information 
section $7). The theoretical result features a finite current peak on the 
degeneracy line between the (0, 0) and (1, 1) states only when considering 
correlated processes that involve co-tunnelling of a pair of electrons into 
the system in concert with an electron tunnelling out of the polarizer. 

e, Ibo (colour scale) measured along a line cutting through the centre of the 
degeneracy line as a function aV and Vep (the independently measured 
lever-arm, a = 0.61, normalizes V to energy units). Three regimes appear 
at different bias ranges, differing in their turn-on slope, s=dVsp/(aV), 

the value of which (s = 0, black line; s ~ 1, blue line; s = 2, red line) reflects 
the number of system electrons participating in the dominant transport 
process (0e, le or 2e, respectively) (see Supplementary Information 
section $7). f, Theoretical master-equation calculation of Iho versus aV 
and Vgp with only a 0e system process considered (left), with 0e + le 
processes (middle) and with 0e + le + 2e processes (left). The 2e processes, 
which involve a pair tunnelling in the system, are the dominant processes 
at low bias, as is observed experimentally, although their calculated 
amplitude is lower than in the experiment, probably because the theory 
considers them to only the lowest order in tunnelling (Supplementary 
Information section $7). 


than is observed experimentally, probably because they consider these 
processes to only the lowest order in tunnelling. 

Our work experimentally establishes the fundamental concept of 
excitonic pairing of electrons that was theorized half a century ago. 
The ability to construct the basic building block of electronic attrac- 
tion raises many questions regarding the possibility of engineering 
exotic states of matter by generalizing this concept to larger systems, 
for example, whether it would be possible to create an artificial super- 
conductor, what kind of pairing such a superconductor would have, 
and whether it would be stable against competing ground states (see 
Supplementary Information section S8 for further discussion). The 
benefit of repulsion-driven attraction is that the pairing energy of the 
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attraction increases linearly with decreasing device dimensions. Our 
experiments achieved pairing energies of about 8 K with rather large 
quantum dots (approximately 400 nm). By extrapolating this value to 
the nanometre scale, we postulate that it should be possible to reach 
energies well in excess of room temperature. Given the tremendous 
progress in engineering quantum dots in two-dimensional semicon- 
ductors**-*8, down to almost single-atom scale”’, the possibilities to 
engineer interesting states of matter based on electronic attraction now 
seem very promising. 
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Engineering and mapping nanocavity emission via 
precision placement of DNA origami 


Ashwin Gopinath!, Evan Miyazono?, Andrei Faraon? & Paul W. K. Rothemund!?4 


Many hybrid devices integrate functional molecular or 
nanoparticle components with microstructures, as exemplified 
by the nanophotonic devices that couple emitters to optical 
resonators! for potential use in single-molecule detection”’, 
precision magnetometry’*, low threshold lasing®® and quantum 
information processing’ '”. These systems also illustrate a common 
difficulty for hybrid devices: although many proof-of-principle 
devices exist, practical applications face the challenge of how 
to incorporate large numbers of chemically diverse functional 
components into microfabricated resonators at precise locations. 
Here we show that the directed self-assembly'** of DNA origami’® 
onto lithographically patterned binding sites allows reliable and 
controllable coupling of molecular emitters to photonic crystal 
cavities (PCCs). The precision of this method is sufficient to enable 
us to visualize the local density of states within PCCs by simple 
wide-field microscopy and to resolve the antinodes of the cavity 
mode at a resolution of about one-tenth of a wavelength. By simply 
changing the number of binding sites, we program the delivery of 
up to seven DNA origami onto distinct antinodes within a single 
cavity and thereby digitally vary the intensity of the cavity emission. 
To demonstrate the scalability of our technique, we fabricate 
65,536 independently programmed PCCs on a single chip. These 
features, in combination with the widely used modularity of DNA 
origami!*°, suggest that our method is well suited for the rapid 
prototyping of a broad array of hybrid nanophotonic devices. 

Solution-synthesized molecules and colloidal nanoparticles offer 
unique optical properties, such as tunable photoluminescence (PL) 
spectra, which are difficult to achieve in the materials we can most 
easily craft into microfabricated optical cavities. Thus hybrid nano- 
photonic systems, in which the strongly localized optical fields within 
microfabricated cavities further enhance the optical properties of 
molecules and nanoparticles! or create new properties such as lasing®® 
or nonlinear phenomena’, are of interest. 

To reliably build such systems, it is necessary to have a fabrication 
method which can both introduce well defined numbers (often 
exactly one) of optically active components into a single cavity, and 
position those components relative to the antinodes of the cavity 
mode with a precision that is well below the wavelength of light (tens 
of nanometres). Further, advanced hybrid devices would organize 
two or more kinds of chemically diverse subcomponents—organic 
molecules, nitrogen—vacancy centres, quantum dots, ions, or metallic 
nanoparticles—in intimate contact at the length scale of the subcom- 
ponents (0.1-10nm). Finally, whether the hybrid device is imagined 
to be part of a hand-held biomedical diagnostic machine, or part of a 
quantum circuit embedded within a classical computer which controls 
it, large numbers of devices must be scalably integrated on chips, using 
CMOS-compatible techniques. 

A complete solution for hybrid device fabrication would address all 
of the above challenges, achieving control of component number within 


a cavity, precise positioning in a cavity mode, modular incorporation of 
diverse materials, finer subcomponent-scale arrangement, and large- 
scale integration. No current technique suffices. Three approaches are 
used to integrate heterogeneous components into microstructures: 
(1) fully stochastic assembly, (2) fully deterministic scanning-probe 
assembly, and (3) directed self-assembly. In (1), fully stochastic 
assembly*°, molecules or nanoparticles are either deposited or grown 
at random locations on a substrate. When resonator locations are 
predefined’, the yield of functional devices in which a single component 
has landed within a resonator is limited by Poisson statistics to 37%, but 
yields can be lower since positioning within the cavity is uncontrolled. 
Alternatively, via ‘select and post-process’!, randomly arranged 
components can be mapped using atomic force microscopy (AFM), 
scanning electron microscopy (SEM), or super-resolution fluorescence 
microscopy and a resonator can be built around the component”. 
Scanning probe assembly (2) involves pushing components into a 
resonator using the scanning probe tip, one at a time!*"!. Neither fully 
stochastic nor scanning-probe approaches can be scaled up. In contrast, 
directed self-assembly (3) uses lithographically defined growth sites 
or binding sites to localize components to microstructures with high 
probability, and is thus inherently scalable. But while previous directed 
self-assembly techniques control emitter number and position’’, they 
do not generalize to other component types, or address subcomponent 
organization. 

Structural DNA nanotechnology”! can create organic/inorganic 
hybrid components at a resolution unachievable by top-down 
fabrication. Particularly versatile in this regard is DNA origami!>”’, in 
which a long single strand of DNA is folded by the concerted action of 
hundreds of much shorter DNA ‘staple strands’: coupling diverse sub- 
components to the staple strands yields modular DNA ‘breadboards’ 
that carry up to 200 different elements (these can range from organic 
dyes'®!7, metal nanoparticles!®!”, nitrogen-vacancy centres”* and 
quantum dots!”-!? to carbon nanotubes” and proteins), simultaneously 
juxtaposed'*!? with 3-5 nm resolution. Large-scale integration of the 
functionalized DNA shapes can then be achieved through electrostatic 
immobilization onto ~100-nm binding sites that have been lithograph- 
ically defined on desired substrates. When optimized, this ‘placement’ 
technique’*"* successfully positions single origami at 94% of bind- 
ing sites with ~20-nm precision. Together, origami and placement 
thus meet all five of the above challenges in the context of a general 
hybrid nanophotonics platform: the former provides modularity and 
fine subcomponent-scale organization, the latter control over compo- 
nent number, positioning and scalability. Below we give examples in 
which origami and placement work together to control the emission 
of thousands of silicon nitride (SiN) PCCs. 

Solution-synthesized DNA origami triangles (Fig. la and Extended 
Data Fig. 1) serve both as components and adaptors, carrying either 
3 or 15 cyanine dye subcomponents (Cy5, Fig. 1b) to triangular binding 
sites within PCCs. To form the origami, a long circular scaffold strand 
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Figure 1 | DNA origami, binding sites for placement, and photonic 
crystal cavities. a, A single-stranded M13mp18 scaffold is annealed with 
staples to form origami triangles with single-stranded linkers (black 
squiggles) for the attachment of subcomponents. b, Fluid AFM of origami 
without (leftmost image) and with (rightmost image) subcomponent 
strands (middle, grey squiggles) labelled with Cy5 (red oval or red dots) 
bound to linkers (black squiggles). The inset spectrum shows Cy5’s 

broad emission, centred at 670 nm). c, Fabrication of a single binding 

site (red), within a passivation layer (blue), followed by construction of a 
photonic crystal cavity (PCC) around it (details in Extended Data Fig. 2). 


(the genome of bacteriophage M13mp18) is annealed (90-20°C, 6h) 
with ~200 short computer-designed staple strands, and a Cy5-labelled 
subcomponent strand (grey strands with a red dot) which binds to 
linkers (black) projecting from a subset of the staple strands. Figure 1c 
outlines the fabrication steps (details in Extended Data Fig. 2) used 
to create both binding sites and PCCs from a 275-nm-thick SiN 
layer (purple) on a Si wafer (grey). Binding sites are created (steps 1 
and 2) via electron beam (e-beam) patterning of negatively charged 
carboxylate groups (red, which bind negatively charged origami 
strongly via positively charged Mg’* ions from solution) within a 
background of hydrophobic methy] groups (blue, which bind origami 
poorly). Directed self-assembly proceeds by incubating a solution 
of purified DNA origami over the patterned substrate which results 
in a high yield of single origami binding events. Figure 1d depicts 
placement reoptimized for SiN, which achieves 98% single-origami 
occupancy. PCCs based on previous designs” are fabricated after the 
binding sites via additional e-beam and etching (steps 3-6). Each cavity 
(Fig. le) is simply a row of three missing holes within a two-dimensional 
(2D) triangular lattice of air holes. In 2D, light is confined to the cavity 
because the spacing of the air holes creates an optical bandgap. Vertical 
confinement is achieved by total internal reflection at the interface of 
the SiN membrane and the air above and below. Cavity geometry was 
optimized using finite-difference time domain (FDTD) simulations to 
set up a high-quality mode (Fig. 1f) between 655nm and 660nm, near 
the emission peak of Cy5. 

A large variation in cavity emission, as a function of Cy5-origami 
position, highlights the importance of emitter placement in both theory 
and experiment (Fig. 2, Extended Data Fig. 4). Because emitters are 
located on the 2D cavity surface, total spontaneous emission Sy; from 
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d, Diagram shows test substrate for placement fidelity (without PCCs), 
and the groups which mediate binding (carboxylate), and non-binding 
(methyl). AFM images show binding sites before (left) and after (right) 
placement; 24 of 25 sites have a single origami (full test substrate: 600 sites, 
98% single origami). e, SEM of a PCC; a= 256 nm, r/a=0.3, r)/a=0.2, 
12/a = 0.25, s=0.22a. f, Low resolution PCC reflectance spectra (black) is 
compared to FDTD prediction (red); fundamental mode w), agrees within 
1nm, lower Q is attributed to fabrication defects”*. Inset, high resolution 
fit to w, (blue). Scale bars: b, 50 nm; d, 400 nm; e, 500 nm. 


a Cy5 at location r= (x, y) for wavelength w has three components!!: 
direct emission into free space, indirect emission into the cavity 
(eventually scattered into free space by imperfect confinement), and 
interference between the two. The measurement set-up (Extended Data 
Fig. 5) sets coefficients Cir, Ccay and Cin; which govern the relative 
contribution of each component to the total spontaneous emission: 


Sr(r, w) = Cair + CawP(r, Ww) + Cinl (1, w) 


where the enhancement of emission into the cavity mode, P(r, w), has 
a Lorentzian line shape centred about the cavity resonance w,, and an 
interference factor I(r, w) accounts for the interference between direct 
and indirect emission at the collection point. Because photons are 
coupled multimodally through free space into the monochromator, 
and because Cy5-origami carry multiple emitters, we neglect Cint 
and assume that any observed dependence on emitter position is due 
to P(r, w). In particular, P(r, w,), the Purcell factor, is proportional 
to the local density of states (LDOS) at r, which exhibits strong 
periodic features with a spacing below the diffraction limit (116 nm 
from central maxima to adjacent minima; see simulations in Figs 2c 
and 3b). To measure these features experimentally, PL spectra 
(Fig. 2b) were recorded for a series of isolated cavities each with a 
Cy5-origami positioned at one of 21 different x-offsets along their 
horizontal midlines (Extended Data Fig. 4a). Oscillations were less 
sharp than expected, but predicted antinodes were still prominent 
and emission from single Cy5-origami varied up to fourfold (Fig. 2c, 
data are Lorentzian fits to spectra). For comparison, we modelled 
random placement by creating PCCs filled with tightly packed origami 
(Extended Data Fig. 4b). Reference patches with the same average 
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Figure 2 | PL spectra as a function of emitter position. a, Dry AFM 

of PCCs with a single origami (n = 15 Cy5/origami) at different 

x-offsets; dots marks cavity centres. Scale bars, 250 nm. b, PL spectra from 
PCCs with x-offsets from —450 nm to +450 nm, with Ax of 45 nm. 

c, Comparison of background-subtracted and normalized PL peak 


number (26) of Cy5-origami were constructed on open SiN. After 
measuring spectra (Extended Data Fig. 4c) for five copies of each, we 
computed the ratio of enhanced to unenhanced emission; randomly 
placed emitters experience only a 1.5 enhancement, on average. 
Thus, placement of emitters is important to realize both maximal and 
reproducible emitter-cavity coupling. 

The ability to spectrally characterize single cavities (Fig. 2b) is 
important, but for mode mapping (Fig. 2c) this approach is tedious 
and sensitive to alignment errors. A variety of scanning probe!®!!?>-?7 
and cathodoluminescence techniques”* have been used to map 
microcavity modes in 1D!°’! and 2D**°8, but they require complex 
instrumentation. By fabricating a 2D array of PCCs whose microscale 
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emission from spectra in b with the normalized LDOS calculated 
via FDTD. Intensity error bars are +1 standard deviation for 6-8 
measurements; x-offset error bars are +1 standard deviation for 2-3 
post-fabrication AFM measurements, averaging +28 nm. 


X-Y coordinates reflect the nanoscale x-y coordinates of a Cy5-origami 
within each cavity (Fig. 3a), wide-field epifluorescence microscopy 
(Fig. 3c) affords direct super-resolution visualization of the 2D LDOS 
(Fig. 3b). Although the predicted subdiffraction mode pattern is clearly 
visible in an image of a single array (Extended Data Fig. 6a), Fig. 3c 
shows an average of five arrays with better signal-to-noise. A 1D slice 
through the map (Fig. 3d) matches the simulated LDOS better than data 
in Fig. 2c (perhaps due to better e-beam alignment for close-packed 
PCC arrays), and emission varies by >5-fold. Resolution is limited 
by the spatial arrangement of emitters on the origami, the precision 
of placement, and the resolution with which e-beam lithography 
can maintain registration between the micrometre-scale PCCs. 


Experiment 


as... 


Figure 3 | Visualization of the LDOS via wide-field microscopy. 

a, Top, the x-y location of a Cy5-origami (left) within a single PCC 
(centre) translates to that PCC’s X-Y location within a 40 x 15 PCC array 
(right; 176 jum x 771m). Dashed rectangle in left image shows the area 
over which origami position was varied, in increments of Ax =26.5nm 
and Ay=22.5nm. Bottom, AFM of four cavities; outline colours indicate 


positions in array shown at top right. b, Simulated LDOS for a single PCC. 
c, Wide-field epifluorescence microscopy average of five copies of the 
PCC array. d, Comparison of b to c along their horizontal midlines; data 
(black) normalized to simulated LDOS (red) at central antinode. Scale 
bars: b, 200 nm; c, 32 um. 
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Figure 4 | Digital control of cavity emission. a, AFM and schematics 
show that up to seven origami can be placed in distinct antinodes of a 
single PCC. b, Epifluorescence images show test patterns of each of the 
seven configurations from a plus an array of zero-site controls for n= 3 
and n= 15 Cy5/origami. 240 copies of each configuration are shown, 


Here emitters lie equally spaced along the ~50 nm (<./10) inner 
edge of origami triangles and so emitter arrangement dominates other 
factors, but designs with more densely arranged emitters should allow 
the method to be tested down to 10-nm resolution. 

We have shown that origami placement in PCCs enables super- 
resolution mapping of cavity modes, but the method’s implications 
are much broader. Our expanded-scale mode map (Fig. 3c) relies on 
the reproducible fabrication of 3,000 devices with precise emitter- 
cavity coupling, and its success implies the ability to create devices 
with arbitrary coupling, up to the maximum available within a single 
antinode. To show that we could further engineer emission through 
coupling to multiple antinodes, we created cavities for which a variable 
number of origami components, from one to seven, were positioned 
within the seven strongest antinodes of the cavity (Fig. 4a). Device 
reproducibility was assessed by creating 64 x 64 arrays of cavities, each 
with eight 512-cavity sub-arrays having the same number (from 0 to 7) 
of origami positioned within them. To demonstrate subcomponent- 
scale organization, we repeated the experiment for two different 
numbers and sublithographic arrangements of dyes (Fig. 4b, n = 3 Cy5 
and n= 15 Cy5). Predicted antinode variability was small (~15% peak 
intensity difference between the central and six surrounding antinodes) 
and so a roughly linear relationship between emission and origami 
number was expected and observed. This allowed us to use sixteen 
64 x 64 PCC arrays on the same chip to recognizably approximate a 
65,536-pixel grayscale image (Extended Data Fig. 9) with eight intensity 
levels (Fig. 4c). 

More quantitatively, single-origami devices in Fig. 4b had emission 
variabilities of 22.6% (that is, +1 coefficient of variation, which is the 
standard deviation divided by the mean) for 3 Cy5 and +10.9% for 
15 Cy5. We modelled device emission as having a binomially 
distributed component, governed by the number n of Cy5 and the 
fraction p of functional Cy5 (simultaneously capturing the purity of 
Cy5-labelled strands, coupling efficiency of these strands to origami, 
and fraction of unbleached Cy5) multiplied by a Gaussian component, 
simultaneously capturing both placement and fabrication errors, which 
should be independent of n. Minimizing squared error between 
modelled and observed device variability for n = 3 and n= 15 simulta- 
neously suggests that p = 87.2% of emitters were coupled and intact, 
and Gaussian noise was 4.5% (1 standard deviation). However, the 
effect of Gaussian noise on total variability was small (<10%): assuming 
a placement and fabrication error of 0%, and solving for p in 
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see Extended Data Fig. 7 for all 512 copies studied. Plots show means from 
Gaussian fits to intensity histograms (Extended Data Fig. 8); error bars are 
+1 standard deviation. c, Van Gogh’s The Starry Night approximated with 
65,536 cavities each having from zero to seven binding sites. See Extended 
Data Fig. 9 for details. 


o= J np(1— p) /np, where np(1 — p) is the variance and np the mean 


of the binomial distribution, gives similar values for both 3 emitters 
(86.7%) and 15 emitters (85%). We did not explore n = 1 because of 
bleaching and instrument sensitivity, but this analysis suggests that with 
longer-lived emitters, 85% yield for single-photon light sources could 
be easily achieved. 

Here we have used DNA origami as modular adaptors to quickly 
switch between two different tightly distributed numbers of emitting 
subcomponents (achieving 2.6 + 0.6 for n=3 and 13+ 1.4 forn=15; 
+ 1 standard deviation) simply by changing a few DNA strands. 
Because any material that can be attached to DNA can now be cou- 
pled to resonators, diverse applications can be explored. For label- 
free (emitter-less) single-molecule detection”, origami modularity 
will allow specificity to be switched quickly between small molecules, 
proteins and nucleic acids. For applications in quantum information 
or magnetometry which require emitters with better photophysics, 
recent advances in nanodiamond nitrogen” and silicon*” vacancy 
centres may provide suitable emitters. Even without emitter improve- 
ments, hybrid nanophotonics based on origami placement will enable 
the cavity amplification of many phenomena, ranging from plasmonic 
enhancement’® and lifetime engineering’? (where origami provide 
metal nanoparticle and emitter integration) to superradiance and 
low-threshold lasing*® (where origami can tightly control emitter 
number and density). Beyond hybrid nanophotonics, our work may 
apply to hybrid nanoelectronics”, and to any heterogeneous fabrica- 
tion wherein molecular or nanoparticulate components are integrated 
with microstructures—the end of low-yield methods such as ‘select and 
post-process’ is in sight. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


DNA origami 

Design. A variation of the ‘sharp triangle’ design described previously’ 
was chosen because the sharp triangle is rigid and it has a low tendency 
to aggregate. The ‘sameside sharp triangle’ used here (details of design in 
Extended Data Fig. 1) has the same scaffold path as the original sharp triangle, 
and the scaffold strand shares the same alignment as in the original structure. 
(For sequences and the caDNAno design, unzip Supplementary Data: specifically, 
files ‘ETSamSide_15Cy5_StapleList.xls’ and ‘Sameside-sharp-triangle-bridged.jsor’ 
which unzip into the directory ‘OrigamiDesignFiles.) The difference between the 
original sharp triangle and the sameside sharp triangle lies in the pattern of nicks 
along the phosphate backbone of the staple strands: in the original sharp triangle, 
nick positions alternate between the two faces of the triangle, whereas the new 
sameside sharp triangle has all nick positions on the same face of the triangle. 
Additionally, 15 staples on the inner edge of the triangle occur in two versions. 
The first version is an unmodified staple. The second version has an 18-nucleotide 
poly-T extension, which serves as a linker that binds to a 21-nucleotide poly-A 
‘subcomponent strand bearing a single Cy5 on its 5’ end. Using this basic design 
we prepared two version of the origami: one had n= 15 Cy5 molecules along its 
inner edge and the second had only n= 3 Cy5 in along its inner edge. 

Synthesis. Staple strands (Integrated DNA Technologies, 100M each in water) and 
the scaffold strand (single-stranded M13mp18, Bayou Biolabs, P-107) were mixed 
together to target concentrations of 100 nM (each staple) and 40 nM, respectively 
(a 2.5:1 staple:scaffold ratio) in 10 mM Tris Base, 1 mM EDTA buffer (adjusted 
to pH 8.35 with HCI) with 12.5mM magnesium chloride. (We refer to the buffer 
used for origami synthesis, with the above combination of Tris Base, EDTA, and 
Mg**, as ‘TE/Mg**”) Staples with the poly-T linker and the Cy5 label were added 
at 5x and 150x excess respectively. 50,11 volumes of staple/scaffold mixture were 
heated to 90°C for 5 min and annealed from 90°C to 20°C at —0.2°C min! ina 
PCR machine. We used 0.5 ml DNA LoBind tubes (Eppendorf) to minimize loss 
of origami to the sides of the tubes. 

Purification. Since a high concentration of excess staples interferes with origami 
placement, the synthesized origami were purified away from excess staples using 
100kD molecular weight cut-off spin filters (Amicon Ultra-0.5 Centrifugal Filter 
Units with Ultracel-100 membranes’). By the following protocol, recovery is 
generally 40%-50% and staples are not visible by agarose gel. Wet the filter by 
adding 500,11 TE/Mg?". Spin filter at 2,000 r.c.f. (relative centrifugal force) for 
6 min at 4°C, until the volume in the filter is 50,11. Discard the filtrate. Add 5011 
of unpurified origami and 40011 TE/Mg”". Spin at 2,000 r.c.f. for 6min at 4°C. 
Wash the origami three more times by discarding the filtrate, adding 450 jl 
TE/Mg”* and spinning at 2,000 r.c.f. for 6 min at 4°C. Invert the filter onto a clean 
tube and spin at 2,000 r.c.f. for 6 min at 4°C to collect purified origami (~50 11). 
Total time for this purification is roughly 40 min. 

All of the work reported in this Letter was performed with spin-column purified 
origami. Other protocols, which may give higher yield via PEG precipitation or 
magnetic bead separation, are available*!? although we have not verified that these 
methods would give quantitatively similar results for placement. 

Quantification. Post-purification, origami were quantified using a NanoDrop 
spectrophotometer (Thermo Scientific). We estimated the molar extinction 
coefficient of the origami triangles as that of a fully double-stranded M13mp18 
molecule (¢ = 123,735,380 M~! cm~1; we did not correct for a small single-stranded 
loop present on one edge of the origami). The typical working concentration for 
origami during placement was 100 pM, which is too small to be measured with 
the NanoDrop, so serial dilutions were performed. For all of the experiments 
performed reported here, we used a single 30011 stock of 20 nM origami solution 
that was diluted to 100 pM immediately before each use. 

Handling origami. After purification and quantification, it is especially important to 
use DNA LoBind tubes (Eppendorf) for storage and dilution of low concentration 
DNA origami solutions. Low dilutions, for example, 100 pM, must be made fresh 
from more concentrated solutions and used immediately—even overnight storage 
can result in total loss of origami to the sides of the tube. Addition of large amounts 
of carrier DNA to prevent origami loss may prevent origami placement, just as 
excess staples do. We have not yet determined whether other blocking agents such 
as BSA might both prevent origami loss and preserve placement. 

Photonic crystal cavity nanofabrication. We fabricated PCCs similar to previous 
designs” using two slightly different approaches. For isolated PCCs occurring in 
widely spaced arrays, like those which appear in Fig. 2, we suspended the PCC 
membranes using a front etch. For large, close-packed arrays of PCCs like those 
which appear in Figs 3 and 4, we suspended the PCC membranes using a back etch. 
Schematics for both fabrication processes are given in Extended Data Fig. 2. All 
fabrication was carried out in Caltech’s Kavli Nanoscience Institute. 

Widely spaced PCC arrays. A schematic of the fabrication process is shown in 
Extended Data Fig. 2a; SEM images of these arrays are given in Extended Data 


Fig. 3. Fabrication began with 275 nm LPCVD (low-pressure chemical vapour 
deposition) growth of SiN on a 100mm single-side polished (SSP, 525 + 251m 
thick) (100) silicon wafer as the base substrate (Rogue Valley Microdevices). The 
wafer was cleaned and alignment markers were defined in the SiN layer by e-beam 
lithography and modified Bosch ICP (inductively coupled plasma) etching. The 
substrate was cleaned again and silanized with a trimethylsilyl passivation layer 
by vapour deposition of HMDS (hexamethyldisilazane). Next, binding sites in the 
shape of a DNA origami were defined at specific locations using the alignment 
markers (defined a priori) by e-beam lithography. Binding sites were then acti- 
vated with a short O, plasma etch and the resist was stripped. Surface silanols on 
binding sites were converted to carboxylate groups via a second silanization. Lastly, 
a PCC was defined around the carboxylated binding site by e-beam lithography 
and modified Bosch ICP etching of the SiN layer. Finally, PCCs were suspended 
using a XeF; isotropic etch of the underlying Si layer. 

Here we note that all modified Bosch ICP etching of SiN, whether for definition 
of PCCs or alignment marks, was performed in an ICP-RIE (inductively coupled 
plasma-reactive ion etcher, Oxford Instruments System 100 ICP 380) at a pressure 
of 4 mtorr, with a flow rate of 25 sccm for SF¢ and 35 sccm for C4F, an RF generator 
power of 25 W, an ICP power of 1,000 W, at 4°C. Further, we note that this modified 
Bosch process does not involve alternating applications of SFs and C4F, but rather 
introduces them as a mixed gas. 

Definition of alignment markers. Substrates were sonicated with isopropanol (IPA) and 
dried in a stream of N> to remove any particulate contaminants. Substrates were spin 
coated with 600 nm of ZEP 520A (10s, 500 rpm; 10s, 1,500 rpm; 120s, 3,000 rpm; 
ZEP 520A is a high-resolution positive electron beam resist from Zeon Chemicals). 
Alignment marker patterns were defined by e-beam. Patterns were developed in 
ZED N50 (Zeon Chemicals electron beam resist developer) for 1 min. Patterns were 
transferred into the substrate using the modified Bosch process for about 30 min. 
Remaining resist was stripped with hot (50 °C) n-methyl pyrrolidone (NMP). 
Surface passivation with trimethylsilyl groups. Substrates were sonicated with IPA 
and dried in a stream of N>, to remove any particulate contaminants. Substrates 
were cleaned with O, plasma in a Plasmatherm Dual-chamber SLR-720 RIE 
(reactive ion etcher), with a flow rate of 50 sccm, a pressure of 50 mtorr and a 
power setting of 80 W (110 W cm~’), for 5 min to create surface silanols. Substrates 
were dehydrated on a hot plate at 150°C for 5 min. Substrates were incubated in a 
41 chamber saturated with HMDS vapour (a 20 cm x 20cm x 10cm Tupperware 
with a 90mm Petri dish in the corner holding 10 ml HMDS) for 20 min. This 
step resulted in the formation of a monolayer of trimethylsilyl groups on the 
surface. Substrates were heated on a hot plate at 150°C for 30 min, to stabilize 
the silanized surface. It is important to bake the substrate both before and after 
HMDS vapour-priming to ensure high quality surface silanization and increase 
the robustness of the monolayer to hydrolysis. Failure to perform either of these 
steps leads to increased background binding of origami. 

Definition of carboxylated binding sites. Binding sites were defined by e-beam 
lithography on a freshly prepared and silanized substrates as follows. Substrates 
were spin-coated with a 170-180 nm resist layer of poly(methyl methacrylate) 
(MicroChem Corp., 950 PMMA, A3) at 2,500 rpm for 90s. The resist was baked 
at 180°C for 30s. Binding sites were defined in the resist using e-beam lithog- 
raphy with a 100 keV beam at 500 pA current. The dosage ranged from 600 to 
850|1Ccm ” and patterns were proximity corrected. The resist was developed for 
70s ina 1:3 solution of methyl isobutyl ketone (MIBK) and isopropanol (IPA). 
Surface silanols were created in the lithographically defined regions by an O, plasma 
etch (Plasmatherm Dual chamber SLR-720 RIE, a flow rate of 20 sccm, a pressure of 
20 mtorr and a power setting of 50 W (92 Wcm ”), for 18s). The resist was stripped 
by sonicating the wafer in bath of NMP at 50°C for 10 min. The surface silanols 
that were created by O2 plasma was converted to carboxylate groups by incubating 
the substrate in a disodium carboxyethylsilanetriol (CTES from Gelest, 25% w/v in 
water Catalog #SIC2263.0) silanization buffer (0.01% CTES, 10 mM Tris, pH 8.3 for 
10 min) followed by sonication in milliQ water. During the last step, ensure that the 
silanization buffer is at pH 8.3. Higher pH leads to the trimethylsilyl background 
being hydrolysed and inadvertently carboxylated. 

Definition of isolated PCCs. After binding site definition, the substrate was cleaned 
with IPA. A 600 nm thick layer of ZEP 520A was spin-coated on the substrate 
and baked at 180°C for 1 min. PCCs were defined using e-beam lithography 
with a 100 keV beam at a current of 500 pA. The dosage ranged from 200 to 
300\1C cm? and patterns were proximity corrected. The exposed resist was 
developed in ZED N50 for 1 min. PCC structures were created via modified 
Bosch etch for 9-12 min, depending on the etch rate observed that day. The 
Si under the PCC was isotropically etched with a custom-built pulsed XeF2 
etcher (5 cycles of 20s each, 500 mtorr XeF2). The resist was stripped by overnight 
soaking of the chip in a bath of NMP at 50°C. It is important not to sonicate the 
substrate after the PCC has been suspended as it leads to the collapse of the PCC 
membrane. 
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Close-packed PCC arrays. A schematic of the fabrication process is given in 
Extended Data Fig. 2b. The process is a modification of procedures in ref. 33 in 
which extremely large SiN membranes were created (up to 2.4mm x 2.4mm). 
The fabrication process is similar to that described for isolated cavities above, with 
the exception that PCC arrays were created on a SiN window supported by a thin 
Si layer (which is removed at the very last step with an XeF, etch). Fabrication 
began with double-side polished silicon wafers (DSP, (100), 380 + 10nm thick, 
University Wafers, Rogue Valley Microdevices) with 275 nm layers of LPCVD- 
grown SiN on both sides of each wafer. 

Etching of large SiN windows. Wafers were cleaned and alignment marker were 
defined by e-beam lithography and ICP/modified Bosch etch (30 min) in the SiN 
layers on both sides, using the wafer flat for alignment. Substrates were spin-coated 
with a 600 nm layer of ZEP 520A on the back side of the wafer. Substrates were 
spin-coated with ProTEK PSB (Brewer Science) on the top side of the wafer to 
protect it from contamination and physical scratches. A 650|1m x 650\1m window 
was defined via e-beam using back-side alignment markers. The resist was devel- 
oped in ZED N50 for 1 min. The SiN layer (on the back side) was etched for 1h 
by the modified Bosch process, to punch through to the Si layer. Si exposed in the 
previous step was etched for 12h in 50% KOH solution to a depth of approximately 
300-350 jum. This created an SiN window of approximately 400 jum x 400 pm 
supported by an Si layer less than 80}1m thick. The ProTEK PSB was stripped 
using hot (50°C) NMP. 

Notes on fabrication. After Si-supported SiN windows were etched, origami binding 
sites and PCC arrays were defined on the top surfaces using earlier-defined top- 
side alignment. We note that the alignment error between patterns on the top and 
bottom of the wafers sometimes reached a few tens of micrometres. However the 
most crucial alignment errors, between the origami binding site and the PCC 
array, were limited to tens of nanometres, because both binding sites and PCCs 
were created using the same set of alignment markers on the top of the wafer. 
Finally, the thin Si support remaining underneath the PCC array was removed 
using XeF) pulsed etching. The suspended membranes are extremely delicate so 
it is crucial that substrate is handled with care. It is important not to sonicate 
suspended photonic crystal membranes as this will break them. 

Origami placement on PCCs. Here the placement DNA origami on binding sites 
is mediated by Mg”* binding to surface carboxylate groups, rather than surface 
silanols, as in our previous work", We have observed that the use of carboxylated 
binding sites allows high-quality origami placement and orientation on SiN at a 
much lower Mg”* concentration (15 mM) than that required (35 mM) for binding 
sites activated only by an O plasma (and which we presume are covered with 
silanols). We have not measured the density of carboxylate groups or silanols in 
either case, but we suggest that the effect is due to the difference in pK,s between 
these two functional groups: similar surface carboxylate groups™ have a pK, of ~6, 
while silanol groups have a pK, of 8.3. Thus binding sites with carboxylate groups 
should carry a higher negative charge at our working pH of 8.3, they should bind 
more Mg’t, and they should enable the observed binding of origami at lower 
Mg?* concentration. 

In addition to decreasing the potential for Mg’” salt artefacts during drying, 
the use of carboxylate groups has a further important added benefit. Over the 
course of the extensive PCC fabrication process, different areas experience many 
different specific series of treatments, which results in many different surface 
types. Some of these, for example the inside of the PCC holes or the back side of 
the PCC membranes, are not passivated with trimethylsilyl groups, and appear to 
bind some DNA origami at higher Mg” concentration. Thus the use of 
carboxylated binding sites (and hence a lower Mg”* concentration for placement) 
decreases nonspecific DNA origami binding and ensures that under our buffer 
conditions the only locations at which origami can stably bind are the intended 
binding sites. 

Below, we describe the placement protocol in five steps, which apart from the 
use of a lower Mg”* concentration and carboxylated binding sites, is similar to 
that in our previous work". Photographs in supplementary figure 3 of ref. 14 
show how substrates should look at different steps of the placement process. (Note 
that our previous work uses carboxylation of binding sites at a different step, after 
origami placement, as part of a method to achieve covalent coupling of origami 
to substrates.) 

Binding. A 50mm Petri dish was prepared with a moistened piece of laboratory 
tissue paper to limit evaporation. Solution with 100 pM origami was prepared 
in ‘placement buffer’ (10 mM Tris, 15 mM Mg?*, pH 8.3) and a 20,11 drop was 
deposited in the middle of the chip, on top of the PCCs. The chip was placed in 
the closed, humid Petri dish and the origami solution was allowed to incubate on 
the chip for 1h. 

Initial wash. After the 1h incubation, excess origami (in solution) were washed 
away with at least 8 buffer washes by pipetting 60 11 of fresh placement buffer onto 
the chip, and pipetting 60 1l off the chip. Each of the 8 washes consisted of pipetting 
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the 6011 volume up and down 2-3 times to mix the fresh buffer with existing buffer 
on the chip. This initial wash took about 2 min. 

Tween wash. Next, in order to remove origami that were non-specifically bound 
to the passivated background, the chip was buffer-washed 5 times using a “Tween 
washing buffer’ with 0.1% Tween 20 (v/v in placement buffer). This took about 
1 min. Because of the low surface tension of the Tween washing buffer, these washes 
were difficult to perform: they involve adding 20-40 jl of wash buffer, just enough 
to cover most of the chip, but not enough to spill over the chip and wet the back side 
of the chip (this may introduce dust contamination from the Petri dish). After the 
fifth wash, the chip was left to incubate for 30 min. (It is important to use Tween 
20, rather than other surfactants. Tween 80 and SDS, which are two other common 
surfactants, lead to markedly different results—Tween 80 leads to the total removal 
of placed origami from the substrate, and SDS does not remove excess origami 
from the trimethylsilyl background. It is important that chips are not exposed to 
Tween 20 until after the origami have been deposited. Tween 20 applied before 
binding completely inactivates the binding sites.) 

Final stabilizing wash. Last, the chip was buffer-washed 8 times back into a higher 
pH ‘stabilizing buffer’ (10 mM Tris, 15mM Mg’, pH 8.9) so that origami bound 
strongly and artefacts were minimized during subsequent drying. This took about 
2min. These washes were relatively high volume (6011) and were intended to 
completely remove the Tween 20. The amount of Tween 20 left was monitored 
qualitatively by the surface tension of the drop (roughly, by eye). When a 20,11 drop 
covered roughly the same area as the initially deposited drop, it was assumed that 
the Tween 20 had been sufficiently removed. In the last wash the chip was left with 
roughly 2011 of stabilizing buffer, and was ready for drying. 

Drying. Chips were dried by serial dilution into ethanol. The chips were dipped 
in 50% ethanol in water (v/v%) for 10s, 75% ethanol in water (v/v%) for 10s and 
then 120s in 90% ethanol in water. Then the chips were air-dried. (If nanoarrays 
are subjected to ethanol solutions with less than 80% ethanol for more than 60-90, 
origami begin to detach from the surface. After the 90% ethanol immersion, it is 
necessary to let the samples air-dry rather than using N2 or compressed air, since 
streaking or other drying artefacts are observed in instances of forced air drying.) 
Troubleshooting origami placement. If low binding, high multiple binding, or 
high background binding artefacts are observed, see the troubleshooting guide 
in Extended Data Table 1. Below we provide notes emphasizing some of the 
more important aspects of the placement protocol, which may be helpful in 
troubleshooting. Do not allow the patterned region with binding sites to dry at 
any point during the binding step or subsequent buffer washes. Inadvertent dewet- 
ting of the binding sites leads to distortion of the origami (causing them to ball 
up) as well as the formation of salt crystals on top of them. Do not use EDTA in 
placement, Tween washing, or imaging buffers. It is unnecessary in this context 
and will change the effective Mg”* concentration for placement slightly. Make 
fresh buffer solutions every week. Here and elsewhere in this work, we use buffers 
at low strength (typically 10 mM) to minimize background binding and to make 
complete washing into different buffers easier. This means the buffers have low 
buffering capacity and the pH will decrease (and placement may cease to work, 
depending on conditions). For example, weak buffers made to read pH 8.35 can 
lose 0.05 pH units and read pH 8.3 after one week. 

AFM characterization. After the cavities were fabricated and the origami were 
placed within the cavities, we quantified the quality of placement using dry AFM. 
While the presence of origami (and the fluorophores they carry) can be inferred 
from the optical experiments, we have used AFM because it provides the ability 
to unambiguously verify the presence of single origami, as well as measure the 
precise location of the origami (and hence fluorophores) with respect to the cavity. 
Unless otherwise specified (images in Fig. 1b were taken under aqueous buffer 
with a Bruker Fastscan AFM), all AFM images were taken in air in tapping mode 
with a Dimension Icon AFM/Nanoscope V Scanner (Bruker) using the short, fat 
cantilever from an SNL probe B (“sharp nitride lever’, 2 nm tip radius, Bruker) 
resonating at approximately 60 kHz. Imaging origami on suspended resonators 
was challenging. All images had to be acquired at the lowest possible force in 
order to reduce tip-sample interaction. We optimized imaging conditions on an 
unsuspended region near the alignment markers and then moved onto the cav- 
ity with precise stage movements. Minimization of imaging force was important: 
if imaging force was increased, imaging became unstable. We noticed that imaging 
near the edge of suspended regions was easier than at the middle; we suggest that 
the suspended SiN membrane is interfering with AFM measurement by acting as 
a mechanical resonator itself. 

Each AFM image was processed using Gwyddion (http://gwyddion.net/). In 
assessing the quality of placement, we measured both the binding site occupancy 
(the percentage of sites with one or more origami), and the number of origami 
at a given site (0, 1, 2, or >3). In previous work!*, we measured the quality of 
placement over dozens of repetitions of placement, where each repetition had more 
than 500 binding sites. Here, because of the difficulty of AFM measurements on 
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PCC resonators, we performed a test for the quality of placement by analysing a 
single array of 600 binding sites on SiN (Fig. 1d), without any PCCs. Carboxylate- 
mediated placement on SiN resulted in no empty binding sites, and few sites with 
multiple bindings (~2% of sites). Thus, we measured a single origami occupancy 
of 98%, consistent with that previously achieved. 

For experiments in which single origami binding sites were written inside PCCs 
(Figs 2 and 3), AFM imaging of selected cavities showed that the fabrication steps 
added to create the PCCs did not degrade the quality of placement: all PCCs 
inspected (Fig. 2a, Fig. 3a, and Extended Data Fig. 4) had binding sites occupied 
by single origami, within ~10 nm of the desired location. We attribute the low 
rate of multiple bindings to the low Mg”* concentration (enabled by carboxylated 
binding sites), the Tween 20 washes, and the proximity correction (which enables 
accurate patterning of the binding sites) that we used. We attribute the low posi- 
tional error to our use of multiple alignment markers within the single e-beam 
field, and the use of the same markers for the patterning of both the PCCs and the 
binding sites. As far as we can tell, the quality of the binding sites is not affected by 
their proximity to etched features, such as the holes in the PCCs. 

For experiments in which multiple origami binding sites were written inside 

resonators (Fig. 4, Extended Data Figs 7 and 9), we observed that resonators 
with larger numbers of binding sites (that is, 5, 6 or 7) had a greater number of 
multiple bindings (which increase the number of origami in the resonator above 
that desired). We did not quantify this effect, because linear fits of emission as a 
function of the number of binding sites suggest that it is not a large effect (Fig. 4b). 
In our previous work", we observed that crowding of binding sites over large arrays 
decreased site occupancy, because sites filled mostly via 2D diffusion of weakly 
surface-bound origami from the edges of the array. Here, crowding of binding 
sites within PCCs seems to have an opposite effect. The context is quite different, 
however. The PCC structure blocks 2D diffusion of origami to binding sites from 
the background, and sites are positioned closely in a different configuration which 
may allow extra origami to bind partially to two adjacent binding sites. The effect 
of the spacing between origami binding sites in different contexts thus deserves 
more study. 
Spectroscopy. For all cross-polarization reflectance and microphotolumines- 
cence (fluorescence) spectroscopy we used the set-up illustrated in Extended 
Data Fig. 5. For cross-polarization reflectance measurements, we used a Fianium 
supercontinuum laser as an excitation source. Broad-spectrum excitation light was 
polarized used a linear polarizer (polarizer 1) and focused onto the PCC sample 
through a 50x (0.8 NA) infinity-corrected objective. The sample was oriented 
at 45° to the incident polarization, and reflected signal collected by the objective 
passed through another polarizer (polarizer 2, orthogonal to polarizer 1) before 
being coupled into a monochromator (a SpectraPro-2500i with a grating of 1,800 
grooves per millimetre from Princeton Instruments). 

For fluorescence spectroscopy we use the same basic set-up but replaced the 
supercontinuum laser with a 15 mW, 638 nm laser, bandpass filtered at 635 + 5nm 
(635 BP). After being expanded to ~500 1m in size, the laser beam was focused 
onto the sample through the same 50x (0.8 NA) objective. On the sample surface, 
the laser spot was approximately 51m in size, as measured by a CCD camera. 
Fluorescence emission was filtered through a 650-nm longpass filter (650 LP) to 
remove excitation light, before being coupled into the monochromator. A marked 
reduction in fluorescence signal was observed after illumination times of 30-45. 
Thus the potential effects of bleaching were minimized by limiting total illumina- 
tion time to 10s: final spectra were produced by averaging 10 spectral scans, each 
taken with an integration time of 1s. 


Wide-field epifluorescence microscopy. All fluorescence imaging was performed 
with an Olympus BX-61 microscope with a xenon excitation source. Excitation 
light was filtered with a 640-nm shortpass filter, and passed through a 645-nm 
dichroic to the sample. After returning through the dichroic, emission light 
was bandpass filtered at 655 + 5nm. The narrow bandpass filter was important 
for emission; longpass filtering of the emission light admitted too much direct 
Cy5 emission, rather than indirect emission through the cavity, and this made it 
impossible to image the 2D mode map. Fluorescence emission was imaged using 
a 50x (0.8 NA) objective onto a Hamamatsu EMCCD cooled to —80°C. Complete 
bleaching of samples was observed at illumination times of approximately 2 min. 
For a particular PCC array, each final image was created by averaging 10 image 
acquisitions. For n = 3 Cy5, integration time for each of these acquisitions was 1 s; 
for n= 15 Cy5, integration time for each acquisition was 200 ms. In the case of 2D 
modemaps, a single final image of a particular PCC array (Extended Data Fig. 6a) 
was somewhat noisy, and five final images of different PCC arrays were averaged 
to yield the final data (Fig. 3c and Extended Data Fig. 6b). Features of the exper- 
imental images and 2D LDOS are qualitatively similar, but we did not attempt to 
fit experimental images to simulation, and instead matched their features by eye. 

FDTD simulations for PCC design and analysis. Three dimensional (3D) 
finite-difference time domain (FDTD) simulation was used both for PCC 
design and to generate simulated LDOS for comparison with experimen- 
tal maps of the resonant cavity modes. All simulations were performed using 
FDTD Solutions from Lumerical Solutions, Inc (https://www.lumerical.com/). 
Lumerical simulation files can be found by unzipping the Supplementary Data, 
in the directory ‘LumericalScripts. Matlab files for creating Autocad versions of 
optimized resonators can also be found by unzipping the Supplementary Data, in 
the directory ‘AutocadScriptGenerator. 

Cavity geometry (Fig. le) was based on previous designs” featuring soft 
confinement». To design the photonic crystal we fixed the refractive index of SiN 
at 2.05, the thickness of the SiN membrane at 275 nm, and adjusted r, r/a, rj, r. and s 
to maximize quality factor within the wavelength range 655-660 nm. Photonic crys- 
tal size was set to 20a in the x-direction and 34.64a in the y-direction. Boundary 
conditions were implemented by introducing a perfect matching layer around the 
structure. The simulation discretization was set to a/R in the x-direction, 0.866a/R 
in the y-direction, and a/R in the z-direction, where the variable R was set to 10 for 
PCC design (so that PCC parameters could be quickly optimized), and set to 20 to 
generated simulated LDOS of higher resolution for comparison with experimental 
mode maps. The simulation modelled emission from a single dipole with polariza- 
tion P(x, y, z)=(1, 1, 0), located at a weak symmetry point close the cavity surface. 
Code availability. The code used to design and simulate the PCCs as well as code 
to generate Autocad files for electron beam lithography defining PCCs and binding 
sites is available as the zip-encoded Supplementary Data file. 
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Extended Data Figure 1 | DNA origami design showing position of Cy5 
fluorophores. a, We used a triangular DNA origami shape, ~130nm on 
each edge, as an adaptor to position fluorophores within a photonic crystal 
cavity (PCC). The origami is composed of a 7,249-nucleotide (nt) single- 
stranded scaffold (the commercially available single-stranded genome 

of the M13mp18 bacteriophage) and approximately 200 commercially 
synthesized ‘staple strands’ which are typically 32 nucleotides in length. 
Mixed together and annealed from 90 to 20°C over the course of about 
6h, the strands self-assemble to form equilateral triangles in high yield. 
Three trapezoidal domains make up this ‘sameside sharp triangle’ design. 
Red filled circles indicate 15 positions at which staples were extended on 
their 5’ end with 18-nt poly-T linkers for the synthesis of origami bearing 
n= 15 CyS. Blue rings indicate the subset of positions used for origami 
bearing n =3 Cy5. To these linkers, 21-nt poly-A ‘subcomponent strands 
modified with a 5’ Cy5 were hybridized. Because the Cy5 modification is 
on their 5’ ends, subcomponent strands put Cy5 within a few nucleotides 
of the origami surface, less than 1 nm from where linkers extend from the 
origami surface (rather than at the 6-nm-away distal end of the linker/Cy5 
strand hybrid). b, Schema showing both components of the fine structure 


of a DNA origami (gaps between helices, and crossover positions, where 
helices are tangent) with linker/Cy5-label positions noted as in a. The 
white outer ring has a 62.4-nm diameter and the white inner ring has a 
31.2-nm diameter. As in a, red dots indicate the subset of positions used 
for origami bearing n = 15 Cy5 and blue rings around red dots indicate 
positions used for origami bearing n = 3 Cy5. Rainbow colours on the 
origami trace the path of the scaffold as it progresses through the origami 
structure, from red to purple. c, Buffer AFM image showing fine structure 
of origami without linkers/Cy5 strands for comparison with b. d, Buffer 
AFM image showing origami with n = 3 linkers and Cy5 strands. In this 
AFM image the Cy5-labelled duplexes appear to fall on the inside of the 
inner triangular hole, perhaps due to adhesion to the mica substrate used 
in these high-resolution experiments. High-resolution imaging of origami 
dried onto SiN is difficult, and so the exact conformation of 

Cy5-labelled duplexes on resonators was not determined. However, 
because the carboxylated sites bind DNA strongly, it seems likely that 
Cy5-labelled duplexes would behave similarly and fall within the 54-nm 
triangular hole of the origami, between 30 and 60 nm from the centre of 
the origami. 
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Extended Data Figure 2 | Process flow for fabricating PCCs. a, Fabrication of isolated PCCs for widely spaced arrays (Fig. 2). b, Fabrication of PCCs 
in close-packed arrays (Figs 3, 4). Note that back-side alignment markers are omitted from b for clarity. After either fabrication process substrates are 


incubated in origami solution, rinsed of excess origami, subjected to an ethanol dilution series, and air-dried. 
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Extended Data Figure 3 | SEM imaging of isolated PCCs used for taking Extended Data Fig. 4. Red squares indicate broken PCC membranes 


fluorescence spectra as a function of x-position. An 8 x 21 array was which were not used. b, Intact membrane. ¢, Typical broken membrane. 
used. a, Positions 01 to 21 each indicate a row of eight copies of isolated d, Zoom-in of PCC. e, Cross-section of PCC membrane. Scale bars: 
PCCs having origami positioned at the same x-offset, as exemplified in a, 300m; b, 241m; d, 1 jum; e, 500 nm. 
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Extended Data Figure 4 | AFM of isolated PCCs and control cavity. a, Schema and representative AFM for each of 21 offsets at which spectra (Fig. 2b) 


were measured. b, Dry AFM ofa control PCC filled with origami (n = 15 Cy5/origami). Scale bar, 250 nm. c, PL reference spectra (red) for Cy5-origami 
on open SiN, and for Cy5-origami filling a cavity, as in b. 
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Extended Data Figure 5 | Optical set-up for reflectance and fluorescence spectroscopy. The 650-nm long-pass filter (650 LP) was used only for 
fluorescence spectroscopy. Details in Methods. 
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Extended Data Figure 6 | Comparison of unaveraged and averaged epifluorescence images of PCC array used for 2D mode map. a, A single 
epifluorescence image of a single array. b, An average of five images taken of five different PCC arrays, as in Fig. 3c. Scale bars, 23 pm. 
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Extended Data Figure 7 | Raw fluorescence data demonstrating of the central panel; each origami has n= 3 Cy5. The right panel shows 
digital control of cavity emission. The central panel diagrams eight an analogous epifluorescence image of eight PCC arrays for which each 
different arrangements of origami (white triangles) within PCCs, with origami has n= 15 Cy5. Histograms summarizing these epifluorescence 
the number of origami ranging from zero to seven. The left panel shows data are given in Extended Data Fig. 8. See Fig. 4b for plots showing the 


an epifluorescence image of eight PCC arrays, each containing 512 copies linearity of intensity as the number of origami are increased. Scale bars, 
of a PCC with the origami arrangement diagrammed in the adjacent row 50 jum. 
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Binding Sites | 3-Cy5-Origami| 15-Cy5-Origami 
111.53 + 25.2 83.75 + 9.1 


1 
171.24+29.9 | 150.17 + 114.5 
3 231.36 + 34.6 | 213.55 +134 
4 313.92 445.1 | 314.68 +211 
Sa) 
7 


375.95 + 44.2 | 409.78 + 24.9 
448.11+51.1 | 507.87 + 35.2 
| 7s«s«2572.45 464.8 | 609.03 + 33.4 


Extended Data Figure 8 | Histograms of numerical data demonstrating 


digital control of cavity emission. Eight different arrangements of 
origami were loaded into PCCs, with the number of origami ranging 


from zero to seven (see centre panel, Extended Fig. 7). 512 identical PCCs 


of each of the nonzero arrangements were imaged by epifluorescence 


microscopy (Extended Data Fig. 7, left and right panels) and their intensity 


was quantified. These data, for n = 3 Cy5 origami (a) and n=15 Cy5 


origami (b) are represented by coloured histograms labelled ‘one’ through 
‘seven. The number of PCCs with a given fluorescence intensity is plotted 
on the y-axis. c, Summary of means and standard deviations 

(for intensity measured as counts) from Gaussian fits in a, b. The peaks of 


these histograms are plotted in Fig. 4b. 
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Extended Data Figure 9 | Schema for recreation of Vincent van Gogh’s extremely large suspended SiN membranes (which easily break) we limited 


painting The Starry Night (1889). See Fig. 4c for recreation. a, Original our recreation to a low-resolution 256 x 256 = 65,536 pixel version, which 
image from Wikimedia Commons (https://commons.wikimedia.org/wiki/ was then further broken down into sixteen 64 x 64 pixel arrays (d). With 
File: VanGogh-starry_night_ballancel.jpg). Because our current emitters one PCC per pixel, these arrays were small enough that we could fabricate 
and cavities provide only one color, with only eight levels of intensity, the and handle the resulting SiN membranes without too much breakage, 
original image was converted to greyscale (b) and then further converted image them separately, and then reassemble the sixteen images into a 4 x 4 
to have three-bit colour depth (c). Because of the difficulty of handling array for the final recreation (Fig. 4c). 
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Extended Data Table 1 | Troubleshooting guide for origami placement 


Tike cause 


Site occupancy below 90%. | * Old chip with inactive sites. 
e Low origami concentration. 
e Short incubation time. 
* Low Mg?t or pH, esp. 
if site occupancy < 30%. 
Primarily: 
¢ High origami concentration. 
e Long incubation time. 
¢ Oversized features. 


High multiple binding. 


Secondarily: 
e High pH. 
* High Mg?*. 


Poor alignment of origami 
with few multiple bindings. 


¢ High pH. 
° High Mg**. 


Problem with trimethylsilyl (TMS) 

background passivation. 

¢ Poor initial TMS quality. 

¢ TMS hydrolyzed by high pH. 

¢ TMS hydrolyzed by long 
incubation. 

¢ Failure to wash weakly 
bound origami from TMS. 


High background binding. 

¢ Whole or partial origami 
on background in AFM. 

e Unstable AFM, e.g. 
whole scanlines of 
identical value (“scars”). 

e For fluorescent origami, 
high background under 
optical imaging. 


Large particulates on sites 
but few or no origami. 


e Sample dewetted or dried. 
Salts and origami aggregates 
occupy the site. 

Small particles on ¢ Overbaked PMMA. 

background. 


¢ Too much time spent in 
dilute ethanol < 80%. 


Origami fall off during 
ethanol drying. 


Origami ball up into site 
during ethanol drying and 
corners are double height. 


¢ Origami project onto 
non-sticky TMS surface. 


¢ High pH of CTES solution 
has hydrolyzed TMS 
(before placement). 


CTES-mediated placement 
yields binding of origami 
everywhere. 


Origami placement!*4 is a robust technique for the delivery of components into microfabricated structures, but the quality of the results is nonlinear in a number of variables, including pH, origami 
concentration, Mg?* ion concentration, and incubation time. To aid reproduction of the technique, we give a guide to the effects that occur, and the remedies to apply, when experimental variables are 


outside the optimal range. 


¢ Acetate causes fine precipitate. 


¢ Chips work best < 24 hours after activation. 
e Use ~100 pM origami. 

Prepare dilution fresh. Use Lo-Bind tubes. 
¢ Incubate origami for an hour. 
° Use 15 mM Mgt 
e Use pH 8.3-8.5. 


First try: 
e Use ~100 pM origami. 
e Keep incubation between 30 and 90 min. 
¢ Look at features in resist by SEM and 
adjust e-beam write (feature size, dose) 
and/or minimize O, activation time. 
Second try: 
¢ Keep pH in the range 8.3-8.5. 
° Use 15 mM Mg*t. 


¢ Keep pH in the range 8.3-8.5. 
° Use 15 mM Mgt 


¢ Dehydrate the wafer by baking before and 
after TMS formation. 
¢ Keep pH< 9 preferably in the range 8.3-8.5. 
e Keep incubation between 30 and 90 minutes. 
e¢ Remove weakly bound origami with 
8x Tween 20 washes. 


¢ Do not let chip dewet during origami 
deposition or subsequent buffer washes. 


* Bake PMMA for 30 s at 180°C . 
e Use non-acetate salts/acids when preparing 
buffers, e.g. use MgClo, and HCI to adjust. 


¢ Move quickly from low to high % ethanol. 


¢ Hydrolyze TMS surface before drying 
by incubating in pH 8.9 stabilization buffer. 


e Use 0.01% CTES in a buffer adjusted to 
a pH of $8.3 for carboxylation step. 
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Cooperative electrocatalytic alcohol oxidation with 
electron-proton-transfer mediators 


Artavazd Badalyan! & Shannon S. Stahl! 


The electrochemical oxidation of alcohols is a major focus of energy 
and chemical conversion efforts, with potential applications ranging 
from fuel cells to biomass utilization and fine-chemical synthesis!’ 
Small-molecule electrocatalysts for processes of this type are 
promising targets for further development’, as demonstrated by 
recent advances in nickel catalysts for electrochemical production 
and oxidation of hydrogen? ''. Complexes with tethered amines 
that resemble the active site of hydrogenases!” have been shown 
both to catalyse hydrogen production (from protons and electrons) 
with rates far exceeding those of such enzymes!"!? and to mediate 
reversible electrocatalytic hydrogen production and oxidation with 
enzyme-like performance". Progress in electrocatalytic alcohol 
oxidation has been more modest. Nickel complexes similar to 
those used for hydrogen oxidation have been shown to mediate 
efficient electrochemical oxidation of benzyl alcohol, with a 
turnover frequency of 2.1 per second. These compounds exhibit 
poor reactivity with ethanol and methanol, however!>. Organic 
nitroxyls, such as TEMPO (2,2,6,6-tetramethyl-1-piperidine 
N-oxyl), are the most widely studied electrocatalysts for alcohol 
oxidation®-”!*!9, These catalysts exhibit good activity (1-2 
turnovers per second) with a wide range of alcohols!® and have 
great promise for electro-organic synthesis’. Their use in energy- 
conversion applications, however, is limited by the high electrode 
potentials required to generate the reactive oxoammonium species. 
Here we report (2,2’-bipyridine) Cu/nitroxyl co-catalyst systems for 
electrochemical alcohol oxidation that proceed with much faster 
rates, while operating at an electrode potential a half-volt lower 
than that used for the TEMPO-only process. The (2,2’-bipyridine) 
Cu(II) and TEMPO redox partners exhibit cooperative reactivity 
and exploit the low-potential, proton-coupled TEMPO/TEMPOH 
redox process rather than the high-potential TEMPO/TEMPO* 
process. The results show how electron-proton-transfer mediators, 
such as TEMPO, may be used in combination with first-row 
transition metals, such as copper, to achieve efficient two-electron 
electrochemical processes, thereby introducing a new concept for 
the development of non-precious-metal electrocatalysts. 

Cu/TEMPO and related catalyst systems have been studied exten- 
sively for aerobic alcohol oxidation’. An early report suggested that 
the Cu component of this co-catalyst system mediates aerobic oxida- 
tion of the hydroxylamine to the oxoammonium form of the nitroxyl 
co-catalyst (see Fig. 1c)*!. Later studies*’-4, however, highlighted the 
mechanistic relationship between these catalysts and galactose oxidase 
(Fig. 1d and e), an enzyme that features a tyrosine-derived phenoxyl 
radical coordinated to a Cu(II) centre in the active site?>”°. Both the 
chemical (Cu/TEMPO) and enzyme-based reactions mediate alcohol 
oxidation via the cooperative reactivity of Cu(II) and an oxyl radical 
as oxidants. This joint participation of two one-electron oxidants con- 
trasts the many alcohol oxidation methods in organic chemistry that 
use a two-electron oxidant, such as TEMPO?™ (see Fig. 1c), Cr(VI) or 
I(III) reagents, or noble-metal catalysts (such as Pd(II) and Ir(III)) that 
undergo facile two-electron redox steps. 


The unique Cu/TEMPO mechanism has important implications 
for electrocatalytic alcohol oxidation because, under appropriate 
conditions, it should be possible to generate TEMPO” at much lower 
electrode potentials than needed for TEMPO?. To test this hypothesis, 
preliminary studies probed the electrochemical behaviour of (bpy) 
Cu/TEMPO (where bpy is 2,2’-bipyridine) solutions under anaerobic 
conditions. A cyclic voltammogram, recorded with a solution of Cu(I) 
OTf (where OTf is trifluoromethanesulfonate), bpy and TEMPO 
(1 mM each) in acetonitrile, revealed (quasi)reversible features 
associated with the (bpy)Cu(II)/(bpy)Cu(I) and TEMPO™” redox 
processes at —0.18 V and +0.24V, respectively, versus ferrocenium/ 
ferrocene (Cp2Fe*’) (Fig. 2a). No TEMPO/TEMPOH redox feature 
was observed in the cyclic voltammogram, reflecting the lack of a pro- 
ton source in solution. Electrochemical oxidation of TEMPOH often 
requires a large overpotential’’, which could reduce the efficiency of 
TEMPO regeneration during catalytic turnover. Independent exper- 
iments, however, highlighted an alternative pathway for TEMPOH 
reoxidation: (bpy)Cu(II)(OT£), was found to mediate efficient oxi- 
dation of TEMPOH to TEMPO under anaerobic conditions in the 
presence of 2,6-lutidine as a Bronsted base (that is, as a proton acceptor). 
The relevance of this process to alcohol oxidation was demonstrated 
via spectrophotometric study of the anaerobic oxidation of benzyl 
alcohol by (bpy)Cu(II)(OTf)2 in the presence of 2,6-lutidine and 
catalytic TEMPO (0.1 equivalents). This mixture led to rapid reduction 
of Cu(ID), affording a solution that was spectroscopically identical to 
an independently prepared solution of Cu(I)OTf/bpy (1:1) (Fig. 2b). 
The reaction did not stop or slow after 10% conversion (that is, after 
complete consumption of TEMPO), and no benzyl alcohol oxidation 
or (bpy)Cu(II) reduction was observed in the absence of TEMPO. 
These observations indicated that (bpy)Cu(II) promoted the oxidation 
of both benzyl alcohol and TEMPOH, as shown in the reaction cycle 
in Fig. 2b, and suggested that electrochemical regeneration of (bpy) 
Cu(II) could provide the basis for electrocatalytic alcohol oxidation. 

Cyclic voltammetry studies in the absence of a Bronsted base 
showed that the electrochemical behaviour of (bpy)Cu(II)(OTf)2 was 
not affected by the presence of TEMPO and/or benzyl alcohol. A slight 
increase in the anodic peak current was observed for the (bpy)Cu(II)/ 
(bpy)Cu(I) wave at —0.14 V upon addition of 2,6-lutidine, presumably 
reflecting catalytic turnover of the Cu(II) (and TEMPO) species on 
the timescale of the cyclic voltammogram scan (see Supplementary 
Fig. 1). This result prompted us to evaluate other Bronsted bases, 
including N-methylimidazole (NMI), 4-(dimethylamino) pyri- 
dine, 2-(dimethylamino)pyridine, 1,4-diazabicyclo[2.2.2]octane, 
triethylamine, diisopropylamine, isopropylamine, tributylamine, 
dibutylamine, n-butylamine and aqueous NaOH (see Supplementary 
Figs 2 and 3). Significant catalytic features were evident in the cyclic 
voltammograms with several of these bases, and triethylamine (Et3N) 
was particularly effective (Fig. 3a). 

The reaction conditions were optimized to permit assessment of the 
catalytic performance of the (bpy)Cu/TEMPO/Et3N catalyst system, 
and the analysis was performed at a scan rate at which the catalytic 
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Figure 1 | Molecular electrocatalysts and related homogeneous and 
enzymatic catalysts. a, Synthetic Ni complex that catalyses H2 formation, 
with pendant amines that function as proton relays, and the analogous 
structure of the [FeFe] hydrogenase active site. b, The redox reactions of 


current (which is proportional to the reaction rate) does not exhibit 
a scan-rate dependence (150mVs7~', Supplementary Fig. 4). The 
catalytic current exhibited a saturation dependence on both [Et3N] 
and [TEMPO], and the concentrations of these components were 
chosen within the concentration-independent-rate regimes (50 mM 
and 5 mM, respectively; see Supplementary Figs 5 and 6). Under these 
conditions, the catalytic current exhibited a linear, first-order depend- 
ence on [(bpy)Cu] and a half-order dependence on [PhCH,0H] 
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Figure 2 | Cyclic voltammogram of a solution of (bpy)Cu(OTf) and 
TEMPO and spectrophotometric evidence for (bpy)Cu(II)-mediated 
oxidation of TEMPOH (and benzyl alcohol) under anaerobic 
conditions. a, Conditions: 1 mM (bpy)Cu(OTf£), 1mM TEMPO, 
10mVs-/, glassy carbon working electrode, 0.1 M BusNClO4/CH3CN. 

b, Conditions: 1 mM Cu(II)(OTf)2, 1mM bpy, 0.1 mM TEMPO, 50mM 
2,6-lutidine, 100 mM PhCH20OH, 2 ml, 25°C, 0.1 M BugNClO4/CH3CN. 

20 min reaction time. The final spectrum corresponds to an independently 
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TEMPO. ¢, d, Simplified catalytic cycles for oxoammonium- and 
(bpy)Cu/TEMPO-based alcohol oxidation pathways. e, The oxidized 
active site of galactose oxidase. 


(see Fig. 3b and c; the intercept in Fig. 3c corresponds to the anodic 
current of the (bpy)Cu(I1)/(bpy)Cu(I) redox couple in the absence 
of alcohol). 

These results may be analysed within the kinetic framework 
formulated by ref. 27 (see Supplementary Information for more 
details). The S-shaped voltammograms indicate that the electroca- 
talysis is not affected by the diffusion of base, substrate or TEMPO 
to the electrode and corresponds to the ‘kinetic regime’ in which the 
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prepared solution of 1 mM (bpy)Cu(I)OTE. The lack of isosbestic 
behaviour at early time points reflects changes in the Cu(II)/Cu(I)/bpy 
speciation under the reaction conditions. We note that the difference in 
peak currents for (bpy)Cu and TEMPO species in a reflects differences in 
the diffusion properties of these species. See Supplementary Information 
section 2 for further discussion. B: is the Bronsted base 2,6-lutidine. au, 
absorbance units; Ej/2, half-wave potential. 
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Figure 3 | Electrochemical oxidation of benzyl alcohol with 
(bpy)Cu/TEMPO and TEMPO catalysts. Cyclic voltammograms, kinetic 
data and performance comparison. a, Cyclic voltammograms recorded 
with solutions of (bpy)Cu/TEMPO, corresponding to 1 mM Cu(I)OTE, 
1mM bpy, 5mM TEMPO (black), upon addition of 50 mM Et3N (red), 
and then addition of 100 mM PhCH,OH (blue). b, Plot of i-a versus [(bpy) 
Cu(I)OTE£]. Conditions: 5mM TEMPO, 50 mM Et3N, 100 mM PhCH,OH. 
¢, Plot of ica, versus [PhCH,OH]°°. Conditions: 1 mM (bpy)Cu(I), 5mM 
TEMPO, 50 mM Et3N. d, Cyclic voltammograms recorded with solutions 


catalytic current is limited only by chemical steps involving the catalyst. 
The current measured for the catalytic reaction, ica, is described by 


leat = nFAC 4 Deat2Kovs (1) 


where k ops =kC? and 


substrate 


ict 1 [RT 2kobs 
ip 0.4463 Fnv 


Equation (1) is derived for two-electron oxidation of a substrate, where 
nis the number of electrons for the (bpy)Cu redox couple (n = 1), F is 
Faraday’s constant, A is the surface area of the electrode, C a is the 
electrocatalyst concentration, D a is the diffusion constant of the 
catalyst, 2 is the stoichiometric factor and ko, is the observed rate con- 
stant. This expression accounts for the first-order dependence of the 
catalytic current on the (bpy)Cu concentration (C oo and the half-order 
dependence on substrate concentration (C hee) (compare Fig. 3b 
and c; the half-order dependence of the catalytic current on Co ictrate 
corresponds to a first-order dependence of the chemical reaction rate 
(Kops) on C eae see Fig. 3g). The ratio icat/ip in equation (2), where 
ip is the peak current of the electrocatalyst in the absence of substrate, 


(2) 
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substrate 


dependence in the k,,,, expression in equation (1). 


of TEMPO, corresponding to 1mM TEMPO (black), 

upon addition of 450 mM NMI (red), and then addition of 100 mM 
PhCH,OH (blue). e, Plot of ica versus [TEMPO]. Conditions: 450 mM 
NMI, 100 mM PhCH;OH. f, Plot of ica: versus [PhCH OH]°°. Conditions: 
1mM TEMPO, 450mM NML. g, Performance comparison between 
(bpy)Cu/TEMPO- and TEMPO-based electrocatalytic oxidation of benzyl 
alcohol. All experiments in 0.1 M BugNClO4/CH3CN, at 150 mV s_! for 
(bpy)Cu/TEMPO and at 100 mV s~! for TEMPO- only reaction. 


provides a convenient means to determine kop; (where R is the ideal gas 
constant, T is the reaction temperature, and v is the scan rate). The 
catalytic wave for (bpy)Cu/TEMPO-mediated oxidation of benzyl alco- 
hol does not exhibit a clearly defined plateau, so the catalytic current 
was recorded at the peak potential of the (bpy)Cu complex. This 
approach will underestimate k,,,, but it avoids the uncertainties of iden- 
tifying a plateau. A kop; value of 11.6s~! at an applied potential of 
—0.14V was determined for the Cu/TEMPO/Et3N catalyst system 
using this method (Fig. 3g). 

Similar studies of electrocatalytic benzyl alcohol oxidation with a 
TEMPO-only catalyst were performed to provide a benchmark for 
comparison. The cyclic voltammogram of TEMPO shows a reversible 
one-electron redox feature corresponding to the TEMPO/TEMPO* 
couple at Ey; =0.24V (versus Cp2Fet’”) (Fig. 3d). Et3N cannot be 
used as a base with TEMPO because Et3N undergoes oxidation at 
the electrode surface in this potential range, and it also undergoes 
a background reaction with TEMPO. NMI has a basicity similar 
to EtsN (pK, values for NMI-H* and Ets;N-H* are 17.1 and 18.8, 
respectively, in CH3CN), and it was identified as an effective Bronsted 
base. TEMPO exhibited an S-shape catalytic wave at the TEMPO/ 
TEMPO* couple in the presence of NMI and benzyl alcohol (Fig. 3d). 
The catalytic current exhibited a saturation dependence on the NMI 
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Figure 4 | Electronic effects and comparative reactivity of different 
catalyst systems with different alcohols. a, b, Hammett plots of 
independent rate measurements of para-substituted benzyl alcohols with 
(bpy)Cu/TEMPO (a) and TEMPO (b) electrocatalysts. See Fig. 3 for 
conditions. c, The proposed mechanism of (bpy)Cu/TEMPO-mediated 
alcohol oxidation. d, Relative catalytic activity of three different catalyst 
systems [Cu/TEMPO, Cu/ABNO, and TEMPO-only] with six different 


concentration, and the NMI concentration was chosen to be within the 
concentration-independent-rate regime (450 mM; see Supplementary 
Fig. 9). Under these conditions, the catalytic current exhibited a linear, 
first-order dependence on [TEMPO] and a half-order dependence 
on benzyl alcohol concentration (Fig. 3e and f; see equation (2)). The 
kinetic constants calculated according to equation (2) revealed a kop, 
value of 2.3s~! at an applied potential of 0.36 V for the TEMPO/NMI 
catalyst system (Fig. 3g). 

Additional insights into the (bpy)Cu/TEMPO and TEMPO-only 
catalyst systems were obtained from kinetic isotope effects and 
Hammett studies. Independent rate measurements for the oxidation 
of PhCH2OH and PhCD,OH revealed a kinetic isotope effect of 1.0 
for (bpy)Cu/TEMPO, while a kinetic isotope effect of 3.2 was observed 
for the TEMPO-only catalyst system. The oxidation rates of six para- 
substituted benzyl alcohols were then measured with both catalyst sys- 
tems. Hammett plots of the data (Fig. 4a and b) reveal relatively weak, 
but opposite, electronic trends: electron-deficient alcohols are oxidized 
more readily with the (bpy)Cu/TEMPO catalyst system, with a slope 
(p) of +0.37, while electron-rich alcohols react more rapidly with the 
TEMPO-only catalyst (9 = —0.24). Previous studies of these catalyst 
systems with chemical oxidants supported the formation of (bpy) 
Cu(II)/alkoxide*?4 and TEMPOt/alkoxide!®’ adducts, respectively, 
which afford the aldehyde or ketone product via intramolecular hydro- 
gen transfer to the TEMPO fragment. With the Cu/TEMPO catalyst, 
the kinetic studies, isotope effects, and Hammett data in Figs 3 and 4a 
may be rationalized by rate-limiting formation of a Cu(II)-alkoxide, 


MeOH 
EtOH 


benzylic, aliphatic, primary and secondary alcohols. Conditions: 100 mM 
alcohol concentration and 0.1 M BuyNClO, in CH3CN, in all cases. 
Cu/TEMPO: 1 mM (bpy)Cu(I)(OT£), 5mM TEMPO, 50 mM Et;N; 
Cu/ABNO: 1 mM (bpy)Cu(I)(OTF£), 1 mM ABNO, 50 mM Et;N. TEMPO: 
1mM TEMPO, 450mM NMI. Rate constants were calculated according to 
equations (1) and (2) (see Supplementary Information for details). 


which will proceed more rapidly with electron-deficient (that is, more 
acidic) alcohols (Fig. 4c). With the TEMPO-only catalyst, the data 
in Figs 3 and 4b are consistent with rate-limiting hydrogen transfer 
from the alcohol to TEMPO?* within the TEMPO*/alkoxide adduct, 
wherein the hydrogen has a slightly hydridic character. 

The modular composition of the (bpy)Cu/TEMPO electrocatalyst 
makes it well suited for further optimization and development. In a 
preliminary assessment of this opportunity, TEMPO was replaced 
with the sterically less-hindered bicyclic nitroxyl, ABNO (9-azabicyclo 
[3.3.1]nonane N-oxyl)””. Six sterically and electronically different alco- 
hols were tested with the three catalyst systems, (bpy)Cu/TEMPO, 
(bpy)Cu/ABNO and TEMPO-only (Fig. 4d). The (bpy)Cu/nitroxyl 
catalysts exhibited substantially higher rates than the TEMPO-only 
catalyst for all substrates tested, including excellent reactivity with 
methanol and ethanol. The high activity of both (bpy)Cu/nitroxyl 
catalysts was especially notable because it occurred at an applied 
potential 0.5 V lower than for the TEMPO-only catalyst (—0.14V 
versus +0.36 V, respectively). The (bpy)Cu/ABNO catalyst exhibited 
higher activity than (bpy)Cu/TEMPO for nearly all alcohols, presum- 
ably reflecting the role of steric effects on the reaction”°. Methanol, 
the smallest available alcohol, was an exception and exhibited nearly 
identical rates with the two catalysts. 

The results presented above show that electrochemical alcohol oxi- 
dation with (bpy)Cu/nitroxyl catalysts occurs at the (bpy)Cu(II)/(bpy) 
Cu(I) redox potential. The nitroxyl is also required for catalytic activ- 
ity, however, and the data are consistent with cooperative reactivity 
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involving the (bpy)Cu(ID and nitroxyl species in a concerted one-pro- 
ton/two-electron oxidation of the Cu(II)-coordinated alkoxide ligand. 
This step affords the aldehyde or ketone product and is reminiscent of 
‘8-hydride elimination’ steps commonly observed with noble metals”. 
However, rather than direct hydride transfer to a single metal centre, 
the proton and two electron equivalents associated with the hydride 
are partitioned between Cu and nitroxyl co-catalysts, affording (bpy) 
Cu(I) and hydroxylamine (for example, TEMPOH) (compare 
Fig. 1b and d). Thus, Cu(II) serves as a one-electron oxidant, while the 
nitroxyl serves as an electron-proton acceptor. 

These observations have important implications for the devel- 
opment of electrocatalysts composed of non-precious metals, with 
applications extending beyond alcohol oxidation. The role of the 
nitroxyl as an electron-proton-transfer mediator (in which both 
an electron and a proton are transferred) in the (bpy)Cu/nitroxyl- 
catalysed reactions may be compared to the role of Bronsted bases in 
many other electrocatalytic reactions, including the tethered amines of 
the hydrogenase mimics in Fig. la. Bronsted bases play a crucial role in 
promoting proton-coupled electron transfer steps, in which deproto- 
nation of a catalytic intermediate is coupled to one-electron transfer to 
the catalytic metal centre or to the electrode. Such concerted steps can 
greatly enhance catalytic rates by avoiding energetically unfavourable 
intermediates that would arise from independent proton- and elec- 
tron-transfer steps”, Electron-proton-transfer mediators, such as 
nitroxyls, expand upon this concept in an important way because such 
entities are equipped to accept both a proton and an electron. By work- 
ing in cooperation with a first-row transition metal, electron-proton- 
transfer mediators provide the basis for efficient proton-coupled 
two-electron reactivity. The present study illustrates this concept, 
and shows how it can lead to much faster electrocatalytic rates and 
lower overpotentials, both of which are crucial in energy-conversion 
applications. Two-electron redox steps are intrinsic to electrochemical 
reactions involving H2, O2, N2, CO2 and numerous other molecules, 
and the development of new cooperative electrocatalysts composed 
of non-precious metals and electron-proton-transfer mediators could 
lead to important breakthroughs in these applications. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Absence of 21st century warming on Antarctic 
Peninsula consistent with natural variability 


John Turner!, Hua Lu', Ian White!, John C. King!, Tony Phillips', J. Scott Hosking!, Thomas J. Bracegirdle!, Gareth J. Marshall', 


Robert Mulvaney! & Pranab Deb! 


Since the 1950s, research stations on the Antarctic Peninsula 
have recorded some of the largest increases in near-surface air 
temperature in the Southern Hemisphere!. This warming has 
contributed to the regional retreat of glaciers”, disintegration 
of floating ice shelves? and a ‘greening’ through the expansion in 
range of various flora*. Several interlinked processes have been 
suggested as contributing to the warming, including stratospheric 
ozone depletion’, local sea-ice loss®, an increase in westerly winds*’, 
and changes in the strength and location of low-high-latitude 
atmospheric teleconnections®”. Here we use a stacked temperature 
record to show an absence of regional warming since the late 1990s. 
The annual mean temperature has decreased at a statistically 
significant rate, with the most rapid cooling during the Austral 
summer. Temperatures have decreased as a consequence of a greater 
frequency of cold, east-to-southeasterly winds, resulting from more 
cyclonic conditions in the northern Weddell Sea associated with a 
strengthening mid-latitude jet. These circulation changes have 
also increased the advection of sea ice towards the east coast of the 
peninsula, amplifying their effects. Our findings cover only 1% of 
the Antarctic continent and emphasize that decadal temperature 
changes in this region are not primarily associated with the drivers 
of global temperature change but, rather, reflect the extreme natural 
internal variability of the regional atmospheric circulation. 

While global mean surface air temperature (SAT) has increased 
over recent decades, the rate of regional warming has varied 
markedly’, with some of the most rapid SAT increases recorded in 
the polar regions'!"'. In Antarctica, the largest SAT increases have 
been observed in the Antarctic Peninsula (AP) and especially on its 
west coast!: in particular, Vernadsky (formerly Faraday) station (Fig. 1) 
experienced an increase in annual mean SAT of 2.8°C between 1951 
and 2000. 

The AP is a challenging area for the attribution of the causes of 
climate change because of the shortness of the in situ records, the 
large inter-annual circulation variability“ and the sensitivity to local 
interactions between the atmosphere, ocean and ice. In addition, the 
atmospheric circulation of the AP and South Pacific are quite different 
between summer (December-February) and the remainder of the year. 

Since the late 1970s, the springtime loss of stratospheric ozone has 
contributed to the warming of the AP, particularly during summer’. 
However, during the extended winter period of March-September, 
when teleconnections between the tropics and high southern latitudes 
are strongest'®, tropical sea surface temperature (SST) anomalies in 
the Pacific and Atlantic Oceans’* can strongly modulate the climate 
of the AP. The teleconnections are further affected by the mid-latitude 
jet, which influences regional cyclonic activity and AP SATs. While the 
jet is strong for most of the year, during the summer it is weaker, there 
are fewer cyclones, and tropical forcing plays little part in AP climate 
variability. 

The annual mean SAT records from six coastal stations located in 
the northern AP (Fig. 1) show a warming through the second half of 


the twentieth century, followed by little change or a decrease during 
the first part of the twenty-first century!’. We investigate the differ- 
ences in high and low latitude forcing on the climate of the AP during 
what we henceforth term the ‘warming’ and ‘cooling’ periods, focus- 
ing particularly on the period since 1979, since this marks the start of 
the availability of reliable, gridded atmospheric analyses and fields of 
sea-ice concentration (SIC). We use a stacked and normalized SAT 
anomaly record (Fig. 2a) based on the six station SAT time series (see 
Methods) to investigate the broad-scale changes that have affected the 
northern AP since 1979. To provide an objective measure of the tim- 
ing of the change in trend we used the sequential Mann-Kendall test 
(see Methods). This identified the middle of 1998 to early 1999 as the 
most likely turning point between the warming and cooling periods 
(indicated by shading on Fig. 2). The trends in the stacked SAT during 
the warming (0.32 + 0.20 per decade (dec~!), 1979-1997) and cooling 
(—0.47 £0.25 dec™!, 1999-2014) periods are both statistically signifi- 
cant at P< 0.05 (Extended Data Table 1). To confirm that the change in 
trend is not simply an artefact of the extreme El Nifo conditions during 
1997-1998, we repeated the analysis for 1979-1996 and 2000-2014. 
The trends were still significant at P < 0.05, although magnitudes were 
slightly smaller. 

While the stacked SAT increased in all seasons during the warm- 
ing period of 1979-1997 (Extended Data Table 1), the warming was 
largest during the summer, although the significance of the trend is 
lower (P< 0.10). During this period there was a positive trend in the 
Southern Annular Mode (SAM)°, primarily during summer (Extended 
Data Fig. 1) in response to stratospheric ozone depletion and increas- 
ing greenhouse gas concentrations”'®. The trend in the SAM led toa 
greater flow of mild, northwesterly air onto the AP (Extended Data 
Fig. 2a), with SAT on the northeastern side increasing most because of 
amplification through the foehn effect’. This atmospheric circulation 
trend contributed to the large decrease in SIC in summer (Extended 
Data Fig. 3a) and for the year as a whole (Fig. 3a). However, there was 
no significant trend in annual mean sea-level pressure (SLP) across the 
AP during the warming period (Fig. 3b). During the summer, tropical 
climate variability had little influence on the AP SATs'® and the trend 
in the SAM had the greatest impact. 

Over the cooling period of 1999-2014 there was a significant increase 
in annual mean SIC around the northern AP and across the northern 
part of the Weddell Sea (Fig. 3c). This occurred as a result of increas- 
ingly cyclonic conditions in the Drake Passage and northwestern 
Weddell Sea (Fig. 3d), associated with a strengthening mid-latitude 
jet, which advected cold air towards the AP. 

The stacked SAT decreased in all seasons over 1999-2014, but with 
the greatest cooling during summer, when the trend was moderately 
significant (P< 0.10) (Extended Data Table 1). In this season the trends 
during the warming and cooling periods are different at a 90% confi- 
dence level (see Methods). 

During the summer the SAM index remained predominantly pos- 
itive (Extended Data Fig. 1), and SLP was on average lower over the 
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Figure 1 | SAT changes at the six AP stations. 
a, b, Map of the Antarctic (a) with a blow up of 
the AP showing the locations of stations referred 
to in the text (b). The locations of the drilling 
sites for the Ferrigno and James Ross Island ice 
cores are indicated by a red diamond and an 


arrowed cross, respectively. c—h, The time series 
of annual mean SAT anomalies are shown for 
Bellingshausen (c), O'Higgins (d), Esperanza 

(e), Marambio (f), Vernadsky (g) and Rothera 
(h), with each horizontal line indicating the 

mean for the whole time series. AS, Amundsen 
Sea; BS, Bellingshausen Sea; DP, Drake Passage; 
WAIS, West Antarctic Ice Sheet; WS, Weddell Sea. 
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Antarctic than during the warming period (Fig. 4a). However, there 
was no significant trend in the SAM, probably owing to there being 
little change in the depth of the ozone hole. 

The summer cooling resulted from strong east-to-southeasterly 
near-surface flow towards the AP as SLP decreased (increased) over 
the South Atlantic (Bellingshausen Sea) (Extended Data Fig. 4a). The 
greater cyclonic conditions over the South Atlantic occurred in associ- 
ation with a mid-latitude jet that was significantly (P < 0.05) stronger 
than during the warming period (Fig. 4b), and which was located at the 
northern limit of a cold trough extending from Antarctica. The stronger 
east-to-southeasterly flow advected sea ice towards the east coast of the 
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AP, giving a positive SIC trend that extended across the whole of the 
northern Weddell Sea (Extended Data Fig. 5a). This greater ice extent 
limited the flux of heat from the ocean and amplified the effects of the 
circulation changes. 

During the extended winter, the circulation over the South Pacific is 
marked by a clearly defined climatological split jet structure, with the 
subtropical jet (STJ) near 30° S and the polar front jet (PFJ) close to 
60° S (ref. 19). The PE] is sensitive to both zonally symmetric forcing, 
such as SAM variability, and regional factors, such as the stationary 
Rossby waves propagating from the tropical Pacific Ocean’, and the 
meridional gradient of extratropical SSTs. In general, the ST] (PFJ) is 


Figure 2 | AP temperature and measures of 
tropical climate variability since 1979. a, The 
stacked-normalized SAT anomalies for 1979- 
2014 (thin black line), with the thick black line 
showing the annual mean values. The solid red 
lines show the linear trends for the warming and 
cooling periods, with the 95% confidence limits 
for the trends indicated by the broken lines. b, The 
monthly mean IPO index (continuous black line) 
and Nifio 3.4 temperature anomaly (broken line), 
with the grey line showing the IPO index with 
decadal smoothing. The vertical grey-shaded area 
on both figures indicates the period of transition 
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Figure 3 | Trends and differences in 


8 8 atmospheric and oceanic conditions. 
=| = BF = a, The trend in annual mean SIC for 1979-1997. 
3\ st b, The trend in annual mean SLP for 1979-1997. 
c, The trend in annual mean SIC for 1999-2014. 
d, The trend in annual mean SLP for 1999-2014. 
8 ‘JS e, The difference in annual mean SSTs between 
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stronger during El Nifo/positive Interdecadal Pacific Oscillation (IPO) 
(La Nifta/negative IPO)”°”!. Nevertheless, their combined effect on 
the AP regional circulation is complex and may involve nonlinear 
wave-mean flow feedbacks”. 

The tropical-high-latitude linkages during the extended winter were 
examined through an analysis of large-scale Rossby wave propagation 
via the horizontal stationary Eliassen—Palm fluxes”? at 300 hPa (see 
Methods). During the 1979-1997 warming period, tropical SSTs were 
characterized by relatively high SSTs across the eastern tropical Pacific 
Ocean with relatively more frequent El Nifio conditions and a more 
positive IPO (Fig. 2b), and thus a relatively weak (strong) PFJ (STJ). 


The generation of quasi-stationary Rossby waves occurred primarily 
close to 180° E, consistent with higher SSTs in this area, and the wave 
propagation from the tropics to the South Pacific was limited by a 
strengthened ST]. 

During the 1999-2014 cooling period, a major change in tropical 
Pacific SSTs occurred with SSTs higher over the Maritime Continent 
and lower over the eastern Pacific Ocean (Fig. 3e). There was also 
enhanced transmission of quasi-stationary Rossby waves from the 
tropics towards the Antarctic, which can be seen in the differences 
in 300hPa stream function (Fig. 4f). However, at higher latitudes 
wave propagation was reduced or prohibited because of enhanced 
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Figure 4 | Differences in atmospheric conditions between the cooling 
and warming periods (1999-2014 minus 1979-1997). a-f, SLP 
December-February (DJF) (a), 300 hPa zonal wind component DJF (b), 
stream function (colours) and wave propagation (arrows) DJF (c), SLP 


equatorward wave refraction or reflection from the PFJ region (Fig. 4f). 
The PFJ was significantly stronger across the South Pacific during the 
cooling period (Fig. 4e), which is consistent with the higher frequency 
of La Nifia-like conditions and an enhanced meridional temperature 
gradient between mid- and high latitudes, corresponding to a negative 
IPO (Fig. 2b). The stronger PFJ resulted in greater surface cyclonic 
activity to the west of the AP during the cooling period (Fig. 4d), which 
typically gives higher SATs across the AP”°. However, critically over 
1999-2014 there was a climatological trough in the Drake Passage, 
giving enhanced easterly flow across the northern Weddell Sea towards 
the AP. The progressively stronger easterly flow is shown by a decrease 
of SLP (significant at P< 0.05 in March-May (MAM) and June-August 
(JJA)) in the Drake Passage (Extended Data Fig. 4b-d) and a positive 
trend in SIC around the northern AP (Extended Data Fig. 5b-d). The 
cold, east-to-southeasterly circulation negated the warming effect from 
the tropics usually found in association with La Nifa-like conditions, 
resulting in a cooling trend in SAT during the extended winter period 
over 1999-2014. 

The start of the AP cooling in 1998 coincided with the so-called 
‘global warming hiatus. There has been extensive discussion over the 
factors responsible for this reduction in the rate of increase in mean 
global SAT”*”S, with the negative phase of the IPO, volcanic and solar 
activity, and aerosol forcing being cited as possible causes. As dis- 
cussed earlier, the phase of the IPO can affect the climate of the AP, 
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March-September (d), 300 hPa zonal wind component March-September 
(e), stream function (colours) and wave propagation (arrows) March- 
September (f). Areas where the differences are significant at P< 0.05 are 
indicated by a bold line. 


but the negative phase is usually associated with higher SATs during the 
extended winter. Since the late 1990s, AP SATs have decreased through- 
out the year, with local factors playing a greater part than tropical vari- 
ability, indicating that the absence of AP SAT warming is independent 
of the global warming hiatus. 

The recent change in SAT trend can be set in a longer-term perspec- 
tive through examination of regional ice core records. An ice core from 
James Ross Island”° (Fig. 1), which is close to Marambio station, showed 
that the region experienced several periods of rapid warming and cool- 
ing in the last 1,000 years, and that the warming trend over the last 100 
years was ‘highly unusual, although not unprecedented. However, the 
period since the late 1970s includes the ozone hole, which is unique 
in the record. The Ferrigno ice core from the coast of West Antarctica 
(Fig. 1) shows a warming from the 1950s to the early twenty-first cen- 
tury that agrees well with the warming observed at Vernadsky”’. In 
the longer term, this record revealed marked decadal variability and, 
importantly, resolved a 50-year period in the eighteenth century when 
SATs increased at a faster rate than observed at Vernadsky over the 
second half of the twentieth century. Such long-term variability is also 
expected from analysis of station SAT records, which exhibit statistical 
long-term persistence”®. Therefore all these studies suggest that the 
rapid warming on the AP since the 1950s and subsequent cooling since 
the late-1990s are both within the bounds of the large natural decadal- 
scale climate variability of the region. This result is also consistent with 
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the very high level of decadal-scale natural internal variability of the 
regional atmospheric circulation seen in long control runs of climate 
models”’. Climate model projections forced with medium emission 
scenarios indicate the emergence of a large anthropogenic regional 
warming signal, comparable in magnitude to the late twentieth cen- 
tury Peninsula warming, during the latter part of the current century”. 
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Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Data. The monthly mean SAT data for the stations were obtained primarily from 
the Scientific Committee on Antarctic Research (SCAR) Reference Antarctic Data 
for Environmental Research (READER) data base*!. These values are derived as 
the mean of the synoptic (6-hourly) observations. A monthly value was produced 
only if 90% of the 6-hourly data were available. To maximize the number of these 
data, if a value was missing, but adjacent 3-hourly values were available, then an 
estimate of the 6-hourly value was made by linearly interpolating between these 
two adjacent values; for example, if a value at 06Z was missing but values existed 
at 03Z and 09Z then the mean of those values would be used to estimate a value 
for 06Z. Any resultant bias will be random and should have a negligible effect on 
the monthly mean. Similar to READER, if there were insufficient synoptic obser- 
vations to derive a monthly mean, a value was taken from the monthly CLIMAT 
message if available. 

To account for differences in temperature variability between the six stations, 
we have generated a monthly mean stacked SAT anomaly record. These values 
are the mean of the available normalized temperature anomalies, of which at 
least five are available from 1979 onwards. Anomalies are calculated relative 
to the 1981-2010 period. Our use of the stacked temperature anomalies is 
justified by the fact that SATs at all six stations have a similar relationship to 
mean SLP (Extended Data Fig. 6) and therefore the broad-scale atmospheric 
circulation, despite there being differences in local effects. Indeed, Ding and 
Steig*? also demonstrated that trends computed from a stacked AP SAT record 
showed good, broad-scale agreement with trends determined from satellite and 
reanalysis data. 

To investigate atmospheric circulation variability for 1979-2014, we use 

the fields from the European Centre for Medium-range Weather Forecasting 
(ECMWEF) Interim reanalysis (ERA-Interim), which have a grid spacing of 
0.7° x 0.7° and are considered to be the best reanalysis for depicting recent 
Antarctic climate**. Fields of mean SIC, computed using the Bootstrap version 2 
algorithm* were obtained on a 25-km-resolution grid from the US National Snow 
and Ice Data Center (http://www.nsidc.org). 
Statistical methods. A non-parametric statistical technique called the sequen- 
tial Mann-Kendall test*°-*” was employed to detect the period when a significant 
change of trend occurred in the monthly mean stacked temperature anomalies. 
Note that the sequential Mann-Kendiall test has been widely used to detect approxi- 
mate potential trend turning points in time series****, 

The null hypothesis we tested was that there is no turning point in the trend 
of the monthly temperature anomaly time series under investigation. To prove 
or to disprove the null hypothesis, our calculation follows the procedure given 
by Gerstengarbe and Werner*®, Let a temperature time series X = {Xj,..., Xn} be 
separated into n — 1 subseries (that is, the first subseries includes the sample values 
Xj, X,..., the second include the values X>, X3,..., and so on). The n — 1 Mann- 
Kendall test statistic variables*® determined by these subseries are given as: 


where R; is the rank of the t-th sub-series {X1, X2,..., X;41}, that is, the num- 
ber of the elements X;{i > j} such that X;> Xj with i=2,..., tandj=1,...,i—1. 
Consequently, for each of the n — 1 subseries, the corresponding progressive row 
U(t) is defined as 


_W-E(M) 


me) Var(W,) 


where E(W,) is the expected value of the respective subseries with 


InCh = 1) 
E(W,) = ———— 
(m) == 
and Var(W,) is the respective variance given by 
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which assumes that, for /; + 00 (J;= the length of the subseries), W, is approxi- 
mately Gaussian and the normalized U(t) is assumed to be a standard Gaussian 
distribution. Similarly, for a given t, the corresponding rank series V(t), 
the so-called retrograde row can be calculated using the reversed time series 
{Xi41, Xp—2.- Xi}. 


Once the progressive and retrograde rank series U(t) and V(t) are calculated 
for all t=1,...,n—1, the temporal locations where U(t) and V(#) cross each other 
are determined and considered as potential trend turning points. When either 
the progressive U(t) or retrograde V(t) row exceeds certain confidence limits (for 
example, the absolute values of U(t) and V(t) become greater than 2 in our case) 
before and after the crossing point, the null hypothesis that the sampled time series 
has no change points is rejected. 

The statistical significance of the linear trends during the warming and cooling 
periods is estimated using the non-parametric Mann-Kendall Tau and Sens slope 
test*®°8, where the P values are calculated as two-sided. To test the significance of 
the trend differences between two periods, we compute the confidence intervals 
at 90% and 95% levels for individual trends and examine whether or not they 
overlap”. 

As the sequential Mann-Kendall test can only detect the approximate time 
when a change of the trend may occur, we find that it is often the case that multi- 
ple changes can be detected over a couple of years. Namely, the trend change may 
actually take a couple of years to occur in the real world. 

Correlations, linear trends and composite differences from gridded data sets 

were computed using monthly anomalies based on a standard least-squares 
method, with the methodology used to calculate the significance levels based upon 
Santer et al.*°. An effective sample size was calculated based on the lag 1 auto- 
correlation coefficient of the regression residuals. This effective sample size was 
used for the computation of the standard error and in indexing the critical values 
of Student's t distribution. On figures showing the differences of means between 
the warming and cooling periods, significance levels were calculated using t-tests 
performed without the assumption of equal variances between the periods being 
compared. For the computation of the correlations, the time series of SST and 
atmospheric fields were all detrended. 
Determination of the stationary wave fluxes. The formulation of Plumb” was 
used to estimate the difference in stationary wave activity between the warming 
and cooling periods in the upper troposphere (for example, 300 hPa). The fluxes 
give an indication of the direction of wave propagation and take the form: 


pe Poose?| 2 g' on fut 2 Ou! 
2 af cosp OX af cosp OX 


where , vy are the zonal and meridional coordinates, u, v are the zonal and merid- 
ional velocity components, p represents normalized pressure, which is pressure 
divided by a standard reference pressure p,, here p, is taken as 1,000 hPa, a is the 
Earth’s radius, f= 2Qsiny is the Coriolis parameter with (2 the Earth’s rotation rate 
and @ the geopotential. Primes represent deviations from the zonal-mean. Note 
that F has been written in terms of the geopotential & by using the geostrophic 
wind relations y = ——", v= Ya 

a acosip 
This is to reduce the possible amplification of noise due to successive derivatives 
and to facilitate the physical understanding of each components. All differentiation 
is performed using centred differences. Also, F is calculated on a monthly basis to 
isolate the stationary perturbations and then a time-average is taken over the 
selected period for each year. 


, where w is the geostrophic stream function. 
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Extended Data Figure 1 | The Southern Annular Mode. The austral summer (December-February) SAM index“? for December 1979-February 2014. 
The linear trends for 1980-1997 and 1999-2014 are shown in red. The data were obtained from https://legacy.bas.ac.uk/met/gjma/sam.html. 
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Extended Data Figure 2 | Seasonal SLP trends during the warming period. a-d, DJF December 1979-February 1998 (a), MAM 1979-1997 (b), 
JJA 1979-1997 (c) and September-November (SON) 1979-1997 (d). Areas where the trends are significant at P < 0.05 are indicated by a bold line. 
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Extended Data Figure 3 | Seasonal trends in sea-ice concentration during the warming period. a~d, DJF December 1979-February 1998 (a), 
MAM 1979-1997 (b), JJA 1979- 1997 (c) and SON 1979-1997 (d). Areas where the trends are significant at P < 0.05 are indicated by a bold line. 
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Extended Data Figure 4 | Seasonal SLP trends during the cooling period. a~d, DJF December 1999-February 2014 (a), MAM 1999-2014 (b), 
JJA 1999-2014 (c) and SON 1999-2014 (d). Areas where the trends are significant at P < 0.05 are indicated by a bold line. 
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Extended Data Figure 5 | Seasonal trends in sea-ice concentration during the cooling period. a~d, DJF December 1999-February 2014 (a), 
MAM 1999-2014 (b), JJA 1999-2014 (c) and SON 1999-2014 (d). Areas where the trends are significant at P< 0.05 are indicated by a bold line. 
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Extended Data Figure 6 | The correlation of annual mean SAT from the stations with annual mean SLP for 1979-2014. a-f, Areas where the 
correlation is significant at P< 0.05 are indicated by a bold line. Rothera (a), Vernadsky (b), Bellingshausen (c), O'Higgins (d), Esperanza (e) and 
Marambio (f). 
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Extended Data Table 1 | Annual and seasonal trends of the stacked, normalized temperature record 


Annual DJF MAM JJA 
1979 — 1997 0.32** 0.51* 0.18 0.04 
1999 - 2014 -0.47** -0.72* -0.39 -0.09 
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SON 


0.33 


-0.36 


Trends are given for the warming (1979-1997) and cooling (1999-2014) periods and are per decade. Significance of the trends is indicated as **P<0.05 and *P<0.10. Note, the DJF trends are for 


the summer seasons over December 1979-February 1998 and December 1999-February 2014. 
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Bright spots among the world’s coral reefs 
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Ongoing declines in the structure and function of the world’s coral 
reefs!” require novel approaches to sustain these ecosystems and the 
millions of people who depend on them’. A presently unexplored 
approach that draws on theory and practice in human health and 
rural development*” is to systematically identify and learn from 
the ‘outliers’—places where ecosystems are substantially better 
(‘bright spots’) or worse (‘dark spots’) than expected, given the 
environmental conditions and socioeconomic drivers they are 
exposed to. Here we compile data from more than 2,500 reefs 
worldwide and develop a Bayesian hierarchical model to generate 
expectations of how standing stocks of reef fish biomass are related 
to 18 socioeconomic drivers and environmental conditions. We 
identify 15 bright spots and 35 dark spots among our global survey 
of coral reefs, defined as sites that have biomass levels more than 
two standard deviations from expectations. Importantly, bright 
spots are not simply comprised of remote areas with low fishing 
pressure; they include localities where human populations and use 
of ecosystem resources is high, potentially providing insights into 
how communities have successfully confronted strong drivers of 
change. Conversely, dark spots are not necessarily the sites with the 
lowest absolute biomass and even include some remote, uninhabited 
locations often considered near pristine®. We surveyed local 
experts about social, institutional, and environmental conditions 
at these sites to reveal that bright spots are characterized by strong 
sociocultural institutions such as customary taboos and marine 
tenure, high levels of local engagement in management, high 
dependence on marine resources, and beneficial environmental 
conditions such as deep-water refuges. Alternatively, dark spots 
are characterized by intensive capture and storage technology and 
a recent history of environmental shocks. Our results suggest that 
investments in strengthening fisheries governance, particularly 
aspects such as participation and property rights, could facilitate 


innovative conservation actions that help communities defy 
expectations of global reef degradation. 

Despite substantial international conservation efforts, diversity and 
abundance continue to decline within many of the world’s ecosystems” 
Most conservation approaches aim to identify and protect places of 
high ecological integrity under minimal threat®. Yet, with escalating 
social and environmental drivers of change, conservation actions are 
also needed where people and nature coexist, especially where human 
effects are already severe’. Here, we highlight an approach for imple- 
menting conservation in coupled human-natural systems focused on 
identifying and learning from outliers—places that are performing 
substantially better than expected, given the socioeconomic and envi- 
ronmental conditions they are exposed to. By their very nature, outliers 
deviate from expectations, and consequently can provide novel insights 
into confronting complex problems where conventional solutions have 
failed. This type of positive deviance, or bright spot analysis has been 
used in fields such as business, health, and human development to 
uncover local actions and governance systems that work in the con- 
text of widespread failure!®!, and holds much promise in informing 
conservation. 

To demonstrate this approach, we compiled data from 2,514 coral 
reefs in 46 countries, states, and territories (hereafter ‘nations/states’) 
and developed a Bayesian hierarchical model to generate expected con- 
ditions of how standing reef fish biomass (a key indicator of resource 
availability and ecosystem functions’) was related to 18 key environ- 
mental variables and socioeconomic drivers (Fig. 1; Extended Data 
Tables 1-4; Extended Data Figs 1-3; Methods). Drawing on a broad 
body of theoretical and empirical research in the social sciences!*"'° 
and ecology”®'® on coupled human-natural systems, we quantified 
how reef fish biomass (Fig. 1a) was related to distal social drivers such 
as markets, affluence, governance, and population (Fig. 1b, c), while 
controlling for well-known environmental conditions such as depth, 
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Figure 1 | Global patterns and drivers of reef fish biomass. a, Reef fish 
biomass among 918 study sites. Points vary in size and colour proportional 
to the amount of fish biomass. b-d, Standardized effect size of local- 

scale social drivers, nation/state-scale social drivers, and environmental 
covariates, respectively. Parameter estimates are Bayesian posterior median 


habitat, and productivity (Fig. 1d) (Extended Data Table 1; Methods). 
In contrast to many global studies of reef systems that are focused on 
demonstrating the severity of human effects®, our examination seeks 


Standardized effect size 


values, 95% uncertainty intervals (UI; thin lines), and 50% UI (thick lines). 
Black dots indicate that the 95% UI does not overlap 0; grey closed circles 
indicates that 75% of the posterior distribution lies to one side of 0; and 
grey open circles indicate that the 50% UI overlaps 0. 


to uncover potential policy levers by highlighting the relative role of 
specific social drivers. A key finding from our global analysis is that 
our metric of potential interactions with urban centres, called market 
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Figure 2 | Bright and dark spots among the world’s coral reefs. a, Each 
site’s deviation from expected biomass (y axis) along a gradient of nation/ 
state mean biomass (x axis). The 50 sites with biomass values >2 standard 
deviations above or below expected values were considered bright (yellow) 
and dark (black) spots, respectively. Each grey vertical line represents a 


nation/state; those with bright or dark spots are labelled and numbered. 
There can be multiple bright or dark spots in each nation/state. b, Map 
highlighting bright and dark spots with large circles, and other sites in 
small circles. Numbers correspond to panel a. 
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Figure 3 | Differences in key social and environmental conditions between bright spots, dark spots, and ‘average’ sites. a, Social and institutional 
conditions; b, external- or donor-driven projects; c, technologies; d, environmental conditions. P values are determined using Fisher's exact test. 


Intensive netting includes beach seine nets, surround gill nets, and muro-ami. 


gravity!” (Methods), more so than local or national population 
pressure, management, environmental conditions, or national socioec- 
onomic context, had the strongest relationship with reef fish biomass 
(Fig. 1). Specifically, we found that reef fish biomass decreased as the 
size and accessibility of markets increased (Extended Data Fig. 1b). 
Somewhat counter-intuitively, fish biomass was higher in places with 
high local human population growth rates, probably reflecting human 
migration to areas of better environmental quality'’—a phenomenon 
that could result in increased degradation at these sites over time. We 
found a strong positive, but less certain relationship (that is, a high 
standardized effect size, but only >75% of the posterior distribution 
above zero) with the Human Development Index, meaning that reefs 
tended to be in better condition in wealthier nations/states (Fig. 1c). 
Our analysis also confirmed the role that marine reserves can play in 
sustaining biomass on coral reefs, but only when compliance is high 
(Fig. 1b), reinforcing the importance of fostering compliance for 
reserves to be successful. 

Next, we identified 15 bright spots and 35 dark spots among the 
world’s coral reefs, defined as sites with biomass levels more than two 
standard deviations higher or lower than expectations from our global 
model, respectively (Fig. 2; Methods; Extended Data Table 5). Rather 
than simply identifying places in the best or worst condition, our bright 
spots approach reveals the places that most strongly defy expectations. 
Using them to inform the conservation discourse will certainly chal- 
lenge established ideas of where and how conservation efforts should 
be focused. For example, remote places far from human impacts are 
conventionally considered near-pristine areas of high conservation 
value’, yet most of the bright spots we identified occur in fished, pop- 
ulated areas (Extended Data Table 5), some with biomass values below 
the global average. Alternatively, some remote places such as parts of 
the northwest Hawaiian Islands underperform (that is, were identified 
as dark spots). 

Detailed analysis of why bright spots can evade the fate of similar 
areas facing equivalent stresses will require a new research agenda 
gathering detailed site-level information on social and institutional 
conditions, technological innovations, external influences, and 
ecological processes!” that are simply not available in a global-scale 
analysis. As a hypothesis-generating exploration to begin uncovering 
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why bright and dark spots may diverge from expectations, we sur- 
veyed data providers who sampled the sites and other experts with 
first-hand knowledge about the presence or absence of ten key social 
and environmental conditions at the 15 bright spots, 35 dark spots, 
and 14 average sites with biomass values closest to model expecta- 
tions (see Methods and Supplementary Information for details). Our 
initial exploration revealed that bright spots were more likely to have 
high levels of local engagement in the management process, high 
dependence on coastal resources, and the presence of sociocultural 
governance institutions such as customary tenure or taboos (Fig. 3; 
Methods). For example, in one bright spot, Karkar Island, Papua New 
Guinea, resource use is restricted through an adaptive rotational har- 
vest system based on ecological feedbacks, marine tenure that allows 
for the exclusion of fishers from outside the local village, and initiation 
rights that limit individuals entry into certain fisheries”. Bright spots 
were also generally proximate to deep water, which may help provide 
a refuge from disturbance for corals and fish”! (Fig. 3; Extended Data 
Fig. 4). Conversely, dark spots were distinguished by having fishing 
technologies allowing for more intensive exploitation, such as fish 
freezers and potentially destructive netting, as well as a recent history 
of environmental shocks (for example, coral bleaching or cyclone; 
Fig. 3). The latter is particularly worrisome in the context of climate 
change, which is likely to lead to increased coral bleaching and more 
intense cyclones”. 

Our global analyses highlight two novel opportunities to inform 
coral reef governance. The first is to use bright spots as agents of 
change to expand the conservation discourse from the current focus 
on protecting places under minimal threat®, towards harnessing les- 
sons from places that have successfully confronted numerous or severe 
stressors. Our bright spots approach can be used to inform the types of 
investments and governance structures that may help to create more 
sustainable pathways for impacted coral reefs. Specifically, our initial 
investigation highlights how investments that strengthen fisheries 
governance, particularly issues such as participation and property 
rights, could help communities to innovate in ways that allow them 
to defy expectations. Conversely, the more typical efforts to provide 
capture and storage infrastructure, particularly where there are envi- 
ronmental shocks and local-scale governance is weak, may lead to 
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social-ecological traps”* that reinforce resource degradation beyond 
expectations. Effectively harnessing the potential to learn from both 
bright and dark spots will require scientists to increase research efforts 
in these places, NGOs to catalyse lessons from other areas, donors to 
start investing in novel solutions, and policy makers to ensure that 
governance structures foster flexible learning and experimentation. 
Indeed, bright spots may have much to offer in terms of how to crea- 
tively confront drivers of change and prioritize conservation actions. 
Likewise, dark spots can help identify development strategies to avoid. 
Critically, the bright spots we identified span the development spec- 
trum from low to high income (for example, Solomon Islands and 
territories of the USA, respectively; Fig. 2), showing that lessons about 
effective reef management can emerge from diverse places. 

A second opportunity stems from a renewed focus on managing the 
socioeconomic drivers that shape reef conditions. Many social drivers 
are amenable to governance interventions, and our comprehensive 
analysis (Fig. 1) suggests that an increased policy focus on social drivers 
such as markets and development could result in improvements to reef 
fish biomass. For example, given the important influence of markets in 
our analysis, reef managers, donor organizations, conservation groups, 
and coastal communities could improve sustainability by developing 
interventions that dampen the negative influence of markets on reef sys- 
tems. A portfolio of market interventions, including eco-labelling and 
sustainable harvesting certifications, fisheries improvement projects, 
and value chain interventions have been developed within large-scale 
industrial fisheries to condition access to markets based on sustainable 
harvesting’**”’. Although there is considerable scope for adapting these 
interventions to artisanal coral reef fisheries in both local and regional 
markets, effectively dampening the negative influence of markets may 
also require developing novel interventions that address the range of 
ways in which markets can lead to overexploitation. Existing research 
suggests that markets create incentives for overexploitation not only by 
affecting price and price variability for reef products”®, but also by influ- 
encing people’ behaviour?””®, including their willingness to cooperate 
in the collective management of natural resources”. 

The long-term viability of coral reefs will ultimately depend on inter- 
national action to reduce carbon emissions”. However, fisheries remain 
a pervasive source of reef degradation, and effective local-level fisheries 
governance is crucial to sustaining ecological processes that give reefs 
the best chance of coping with global environmental change*”. Seeking 
out and learning from bright spots is a novel approach to conserva- 
tion that may offer insights into confronting the complex governance 
problems facing coupled human-natural systems such as coral reefs. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 
Scales of data. Our data were organized at three spatial scales: 


(i) Reef (n=2,514). The smallest scale, which had an average of 2.4 surveys/ 
transects. 

(ii) Site (a cluster of reefs; n=918). We clustered reefs together that were with- 
in 4km of each other, and used the centroid of these clusters to estimate 
site-level social and site-level environmental covariates (Extended Data 
Table 1). To make these clusters, we first estimated the linear distance 
between all reefs, then used a hierarchical analysis with the complete- 
linkage clustering technique based on the maximum distance between 
reefs. We set the cut-off at 4km to select mutually exclusive sites where reefs 
cannot be more distant than 4km. The choice of 4km was informed by a 
3-year study of the spatial movement patterns of artisanal coral reef fishers, 
corresponding to the highest density of fishing activities on reefs based on 
GPS-derived effort density maps of artisanal coral reef fishing activities*'. 
This clustering analysis was carried out using the R functions hclust and 
cutree, resulting in an average of 2.7 reefs per site. 

(iii) Nation/state (nation, state, or territory; n= 46). A larger scale in our analysis 
was nation/state, which are jurisdictions that generally correspond to indi- 
vidual nations (but could also include states, territories, overseas regions, or 
extremely remote areas within a state such as the northwest Hawaiian Islands; 
Extended Data Table 2), within which sites and reefs were nested for analysis. 


Estimating biomass. Reef fish biomass can reflect a broad selection of reef fish 
functioning and benthic conditions!*“?"4, and is a key metric of resource availabil- 
ity for reef fisheries. Reef fish biomass estimates were based on instantaneous visual 
counts from 6,088 surveys collected from 2,514 reefs. All surveys used standard 
belt-transects, distance sampling, or point-counts, and were conducted between 
2004 and 2013. Where data from multiple years were available from a single reef, 
we included only data from the year closest to 2010. Within each survey area, reef 
associated fishes were identified to species level, abundance counted, and total 
length (TL) estimated, with the exception of one data provider who measured 
biomass at the family level. To make estimates of biomass from these transect-level 
data comparable among studies, we: 


(iv) Retained families that were consistently studied and were above a mini- 
mum size cut-off. Thus, we retained counts of >10-cm diurnally active, 
non-cryptic reef fish that are resident on the reef (20 families, 774 species), 
excluding sharks and semi-pelagic species. We also excluded three groups 
of fishes that are strongly associated with coral habitat conditions and are 
rarely targets for fisheries (Anthiinae, Chaetodontidae, and Cirrhitidae). 
Families included are: Acanthuridae, Balistidae, Diodontidae, Ephippidae, 
Haemulidae, Kyphosidae, Labridae, Lethrinidae, Lutjanidae, Monacanthi- 
dae, Mullidae, Nemipteridae, Pinguipedidae, Pomacanthidae, Serranidae, 
Siganidae, Sparidae, Synodontidae, Tetraodontidae and Zanclidae. We 
calculated the total biomass of fish on each reef using standard published 
species-level length-weight relationship parameters or those available on 
FishBase**. When length—weight relationship parameters were not available 
for a species, we used the parameters for a closely related species or genus. 

(v) Directly accounted for depth and habitat as covariates in the model (see 
Environmental conditions section below). 

(vi) Accounted for any potential bias among data providers (capturing informa- 
tion on both inter-observer differences, and census methods) by including 
each data provider as a random effect in our model. 


Biomass means, medians, and standard deviations were calculated at the reef- 
scale. All reported log values are the natural log. 
Social drivers 
Local population growth. We created a 100 km buffer around each site and used 
this to calculate human population within the buffer in 2000 and 2010 based 
on the Socioeconomic Data and Application Centre (SEDAC) gridded popula- 
tion of the world database*®. Population growth was the proportional difference 
between the population in 2000 and 2010. We chose a 100 km buffer as a reasonable 
range at which many key human impacts from population (for example, land-use 
and nutrients) might affect reefs*”. 
Management. For each site, we determined if it was unfished, that is, whether it 
fell within the borders of a no-take marine reserve (we asked data providers to 
further classify whether the reserve had high or low levels of compliance); 
restricted—whether there were active restrictions on gears (for example, bans on 
the use of nets, spear guns, or traps) or fishing effort (which could have included 
areas inside marine parks that were not necessarily no take); or fished, that is, reg- 
ularly fished without effective restrictions. To determine these classifications, we 


used the expert opinion of the data providers, and triangulated this with a global 
database of marine reserve boundaries**. 

Gravity. We adapted the economic geography concept of ‘gravity’!7°°~*1, also called 
interactance™, to examine potential interactions between reefs and: (i) major urban 
centres/markets (defined as provincial capital cities, major population centres, 
landmark cities, national capitals, and ports); and (ii) the nearest human settle- 
ments. This application of the gravity concept infers that potential interactions 
increase with population size, but decay exponentially with the effective distance 
between two points. Thus, we gathered data on both population estimates and a 
surrogate for distance: travel time. 

Population estimations. We gathered population estimates for: (i) the nearest major 
markets (which includes national capitals, provincial capitals, major population 
centres, ports, and landmark cities) using the World Cities base map from ESRI; 
and (ii) the nearest human settlement within a 500 km radius using LandScan 2011 
database. The different data sets were required because the latter is available in 
raster format while the former is available as point data. We chose a 500 km radius 
from the nearest settlement as the maximum distance any non-market fishing 
activities for fresh reef fish are likely to occur. 

Travel time calculation. Travel time was computed using a cost-distance algorithm 
that computes the least ‘cost’ (in minutes) of travelling between two locations on 
a regular raster grid. In our case, the two locations were either the centroid of the 
site (that is, reef cluster) and the nearest settlement, or the centroid of the site and 
the major market. The cost (that is, time) of travelling between the two locations 
was determined by using a raster grid of land cover and road networks with the 
cells containing values that represent the time required to travel across them’: 


e Tree cover, broadleaved, deciduous and evergreen, closed; regularly flooded 
tree cover, shrub, or herbaceous cover (fresh, saline, & brackish water) = speed 
of lkmh7! 

¢ Tree cover, broadleaved, deciduous, open (open = 15-40% tree cover) = speed 
of 1.25kmh7! 

e Tree cover, needle-leaved, deciduous and evergreen, mixed leaf type; shrub 
cover, closed-open, deciduous and evergreen; herbaceous cover, closed-open; 
cultivated and managed areas; mosaic: cropland/tree cover/other natural veg- 
etation, cropland/shrub or grass cover = speed of 1.5kmh =! 

e Mosaic: tree cover/other natural vegetation; tree cover, burnt= speed of 
1.25kmh"! 

+ Sparse herbaceous or sparse shrub cover = speed of 2.5kmh =! 

- Water= speed of 20kmh~! 

¢ Roads=speed of 60kmh7! 

+» Track=speed of 30kmh ! 

- Artificial surfaces and associated areas = speed of 30kmh7! 

+ Missing values = speed of 1.4kmh =! 


We termed this raster grid a friction-surface (with the time required to travel 
across different types of surfaces analogous to different levels of friction). To 
develop the friction-surface, we used global data sets of road networks, land cover, 
and shorelines: 


e Road network data was extracted from the Vector Map Level 0 (VMap0) from 
the National Imagery and Mapping Agency’s (NIMA) Digital Chart of the 
World (DCW). We converted vector data from VMap0 to 1km resolution 
raster. 

e Land cover data were extracted from the Global Land Cover 2000 (ref. 44). 

e To define the shorelines, we used the GSHHS (Global Self-consistent, 
Hierarchical, High-resolution Shoreline) database version 2.2.2. 


These three friction components (road networks, land cover, and water bodies) 
were combined into a single friction surface with a Behrmann map projection. 
We calculated our cost-distance models in R* using the accCost function of the 
gdistance package. The function uses Dijkstra’s algorithm to calculate least-cost 
distance between two cells on the grid and the associated distance taking into 
account obstacles and the local friction of the landscape“. Travel time estimates 
over a particular surface could be affected by the infrastructure (for example, road 
quality) and types of technology used (for example, types of boats). These types 
of data were not available at a global scale but could be important modifications 
in more localized studies. 

Gravity computation. To compute the gravity to the nearest market, we calculated 
the population of the nearest major market and divided that by the squared travel 
time between the market and the site. Although other exponents can be used?’, we 
used the squared distance (or in our case, travel time), which is relatively common 
in geography and economics. This decay function could be influenced by local 
considerations, such as infrastructure quality (for example, roads), the types of 
transport technology (that is, vessels being used), and fuel prices, which were not 
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available in a comparable format for this global analysis, but could be important 
considerations in more localized adaptations of this study. 

To determine the gravity of the nearest settlement, we located the nearest pop- 
ulated pixel within 500 km, determined the population of that pixel, and divided 
that by the squared travel time between that cell and the reef site. 

As is standard practice in many agricultural economics studies*®, an assumption 
in our study is that the nearest major capital or landmark city represents a market. 
Ideally we would have used a global database of all local and regional markets 
for coral reef fish, but this type of database is not available at a global scale. As a 
sensitivity analysis to help justify our assumption that capital and landmark cities 
were a reasonable proxy for reef fish markets, we tested a series of candidate models 
that predicted biomass based on: (1) cumulative gravity of all cities within 500 km; 
(2) gravity of the nearest city; (3) travel time to the nearest city; (4) population of 
the nearest city; (5) gravity to the nearest human population above 40 peoplekm? 
(assumed to be a small peri-urban area and potential local market); (6) the travel 
time between the reef and a small peri-urban area; (7) the population size of the 
small peri-urban population; (8) gravity to the nearest human population above 
75 people km? (assumed to be a large peri-urban area and potential market); 
(9) the travel time between the reef and this large peri-urban population; (10) the 
population size of this large peri-urban population; and (11) the total population 
size within a 500 km radius. Model selection revealed that the best two models 
were gravity of the nearest city and gravity of all cities within 500 km (with a 3 
AIC value difference between them; Extended Data Table 3). Importantly, when 
looking at the individual components of gravity models, the travel time compo- 
nents all had a much lower AIC value than the population components, which is 
broadly consistent with previous systematic review studies’. Similarly, travel time 
to the nearest city had a lower AIC score than any aspect of either the peri-urban 
or urban measures. This suggests our use of capital and landmark cities is likely 
to better capture exploitation drivers from markets rather than simple popula- 
tion pressures. This may be because market dynamics are difficult to capture by 
population threshold estimates; for example some small provincial capitals where 
fish markets are located have very low population densities, while some larger 
population centres may not have a market. Downscaled regional or local analyses 
could attempt to use more detailed knowledge about fish markets, but we used the 
best proxy available at a global scale. 

Human Development Index (HDI). HDI is a summary measure of human devel- 
opment encompassing: a long and healthy life, being knowledgeable, and having 
a decent standard of living. In cases where HDI values were not available specific 
to the State (for example, Florida and Hawaii), we used the national (for example, 
USA) HDI value. 

Population size. For each nation/state, we determined the size of the human pop- 
ulation. Data were derived mainly from census reports, the CIA fact book, and 
Wikipedia. 

Tourism. We examined tourist arrivals relative to the nation/state population 
size (above). Tourism arrivals were gathered primarily from the World Tourism 
Organization's Compendium of Tourism Statistics. 

National reef fish landings. Catch data were obtained from the Sea Around Us 
Project (SAUP) catch database (http://www.seaaroundus.org), except for Florida, 
which was not reported separately in the database. We identified 200 reef fish 
species and taxon groups in the SAUP catch database*’. Note that reef-associated 
pelagics such as scombrids and carangids normally form part of reef fish catches. 
However, we chose not to include these species because they are also targeted and 
caught in large amounts by large-scale, non-reef operations. 

Voice and accountability. This metric, from the World Bank survey on governance, 
reflects the perceptions of the extent to which a country’s citizens are able to par- 
ticipate in selecting their government, as well as freedom of expression, freedom of 
association, and a free media. In cases where governance values were not available 
specific to the nation/state (for example, Florida and Hawaii), we used national 
(for example, USA) values. 

Environmental drivers 

Depth. The depth of reef surveys were grouped into the following categories: <4m, 
4-10m, >10m to account for broad differences in reef fish community structure 
attributable to a number of inter-linked depth-related factors. Categories were 
necessary to standardise methods used by data providers and were determined by 
pre-existing categories used by several data providers. 

Habitat. We included the following habitat categories: 


(i) Slope. The reef slope habitat is typically on the ocean side of a reef, where the 
reef slopes down into deeper water. 

(ii) Crest. The reef crest habitat is the section that joins a reef slope to the reef flat. 
The zone is typified by high wave energy (that is, where the waves break). It 
is also typified by a change in the angle of the reef from an inclined slope to 
a horizontal reef flat. 
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(iii) Flat. The reef flat habitat is typically horizontal and extends back from the 
reef crest for 10’s to 100’s of metres; 

(iv) Lagoon/back reef. Lagoon reef habitats are where the continuous reef flat 
breaks up into more patchy reef environments sheltered from wave energy. 
These habitats can be behind barrier/fringing reefs or within atolls. Back 
reef habitats are similar broken habitats where the wave energy does not 
typically reach the reefs and thus forms a less continuous ‘lagoon style’ reef 
habitat. Due to minimal representation among our sample, we excluded 
other less prevalent habitat types, such as channels and banks. To verify 
the sites’ habitat information, we used the Millennium Coral Reef Map- 
ping Project (MCRMP) hierarchical data®!, Google Earth, and site depth 
information. 


Productivity. We examined ocean productivity for each of our sites in mg of C per 
m” per day (http://www.science.oregonstate.edu/ocean.productivity/). Using the 
monthly data for years 2005 to 2010 (in hdf format), we imported and converted 
those data into ArcGIS. We then calculated yearly average and finally an average 
for all these years. We used a 100 km buffer around each of our sites and examined 
the average productivity within that radius. Note that ocean productivity esti- 
mates are less accurate for near-shore environments, but we used the best available 
data. 

Analyses. We first looked for collinearity among our covariates using bivariate 
correlations and variance inflation factor estimates (Extended Data Fig. 2 and 
Extended Data Table 4). This led to the exclusion of several covariates (not 
described above): (i) geographic basin (tropical Atlantic, western Indo-Pacific, 
central Indo-Pacific, or eastern Indo-Pacific); (ii) gross domestic product (purchas- 
ing power parity); (iii) rule of law (World Bank governance index); (iv) control of 
corruption (World Bank governance index); and (v) sedimentation. Additionally, 
we removed an index of climate stress, developed by Maina et al.*?, which incor- 
porated 11 different environmental conditions, such as the mean and variability 
of sea surface temperature due to repeated lack of convergence for this parameter 
in the model, likely indicative of unidentified multicollinearity. All other covar- 
iates had correlation coefficients 0.7 or less and variance inflation factor scores 
less than 5 (indicating multicollinearity was not a serious concern). Care must be 
taken in causal attribution of covariates that were significant in our model, but 
demonstrated collinearity with candidate covariates that were removed during the 
aforementioned process. Importantly, the covariate that exhibited the largest effect 
size in our model, market gravity, was not strongly collinear with other candidate 
covariates. 

To quantify the multi-scale social, environmental, and economic factors affect- 
ing reef fish biomass we adopted a Bayesian hierarchical modelling approach that 
explicitly recognized the three scales of spatial organization: reef (j), site (k), and 
nation/state (s). 

In adopting the Bayesian approach we developed two models for inference: 
a null model, consisting only of the hierarchical units of observation (that is, 
intercepts-only) and a full model that included all of our covariates (drivers) of 
interest. Covariates were entered into the model at the relevant scale, leading 
to a hierarchical model whereby lower-level intercepts (averages) were placed 
in the context of higher-level covariates in which they were nested. We used 
the null model as a baseline against which we could ensure that our full model 
performed better than a model with no covariate information. We did not 
remove ‘non-significant’ covariates from the model because each covariate was 
carefully considered for inclusion and could therefore reasonably be considered 
as having an effect, even if small or uncertain; removing factors from the model 
is equivalent to fixing parameter estimates at exactly zero—a highly-subjective 
modelling decision after covariates have already been selected as potentially 
important>’. 

The full model assumed the observed, reef-scale observations of fish biomass 
(vijzs) were modelled using a non-central ¢ distribution, allowing for fatter tails than 
typical log-normal models of reef fish biomass**. We chose the non-central t after 
having initially used a log-normal model because our model diagnostics suggested 
that several model parameters had not converged. We ran a supplementary analysis 
to support our use of the non-central f distribution with 3.5 degrees of freedom 
(see Supplementary Information). Therefore our model was: 


logly;xs] ~ non-central t(Lijess Treefs 3.5) 
Maks = Bojts + PreetXreet 
Treef ~ U(0,100)-? 


with X;eep representing the matrix of observed reef-scale covariates and (yeep array 
of estimated reef-scale parameters. The Tyee (and all subsequent 7 values) were 
assumed common across observations in the final model and were minimally 
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informative®. Using a similar structure, the reef-scale intercepts (Gojks) Were struc- 
tured as a function of site-scale covariates (Xit): 


Bojks oN N(Ljps Tit) 
Hiks = Yoks ate VeiXsit 
Tit ~ U(0,100)-? 


with 7.j¢ representing an array of site-scale parameters. Building upon the hier- 
archy, the site-scale intercepts (yox;) were structured as a function of state-scale 
covariates (X¢tq): 


Yoks ™ N( flys Tata) 
bs = Yor VetaXsta 
Tata ~ U(0,100)-? 


Finally, at the top scale of the analysis we allowed for a global (overall) estimate of 
average log-biomass (yo): 


Yo ~ N(0.0,1000) 


The relationships between fish biomass and reef, site, and state-scale drivers 

was carried out using the PyMC package™ for the Python programming language, 
using a Metropolis-Hastings (MH) sampler run for 10° iterations, with a 900,000 
iteration burn-in thinned by 10, leaving 10,000 samples in the posterior distribu- 
tion of each parameter; these long burn-in times are often required with a com- 
plex model using the MH algorithm. Convergence was monitored by examining 
posterior chains and distributions for stability and by running multiple chains 
from different starting points and checking for convergence using Gelman-Rubin 
statistics”? for parameters across multiple chains; all were at or close to 1, indicating 
good convergence of parameters across multiple chains. 
Overall model fit. We conducted posterior predictive checks for goodness of fit 
(GoF) using Bayesian P values*? (BpV), whereby fit was assessed by the discrep- 
ancy between observed or simulated data and their expected values. To do this we 
simulated new data (y;"°”) by sampling from the joint posterior of our model (6) 
and calculated the Freeman-Tukey measure of discrepancy for the observed (y;°"’) 
or simulated data, given their expected values (;1;): 


DIA) =D (WH — JH? 
t 

yielding two arrays of median discrepancies D(y°'S/@) and D(y"*"/0) that were 
then used to calculate a BpV for our model by recording the proportion of times 
D(y°S/0) was greater than D(y"*”/0) (Extended Data Fig. 3a). A BpV above 0.975 
or under 0.025 provides substantial evidence for lack of model fit. Evaluated by 
the deviance information criterion (DIC), the full model greatly outperformed a 
null model that included no covariates (ADIC = 472). 

To examine homoscedasticity, we checked residuals against fitted values. We 

also checked the residuals against all covariates included in the model, and several 
covariates that were not included in the model (primarily due to collinearity), 
including: (i) Atoll, a binary metric of whether the reef was on an atoll or not; 
(ii) control of corruption, perceptions of the extent to which public power is exer- 
cised for private gain, including both petty and grand forms of corruption, as well 
as ‘capture’ of the state by elites and private interests, derived from the World Bank 
survey on governance; (iii) geographic basin, whether the site was in the tropi- 
cal Atlantic, western Indo-Pacific, central Indo-Pacific, or eastern Indo-Pacific; 
(iv) connectivity, we examined three measures based on the area of coral reef 
within a 30km, 100 km, and 600 km radius of the site; (v) sedimentation; (vi) coral 
cover (which was only available for a subset of the sites); (vii) climate stress*’; and 
(viii) census method. The model residuals showed no patterns with these eight 
additional covariates, suggesting they would not explain additional information 
in our model. 
Bright and dark spot estimates. Because the performance of site scale locations 
are of substantial interest in uncovering novel solutions for reef conservation, we 
defined bright and dark spots at the site scale. To this end, we defined bright (or 
dark) spots as locations where expected site-scale intercepts (Yoxs) differed by more 
than two standard deviations from their nation/state-scale expected value (/1,;), 
given all the covariates present in the full hierarchical model: 


SSspot = | (tgs — Vors)| > 218-4. (Hy; — Yors)] 


This, in effect, probabilistically identified the most deviant sites, given the 
model, while shrinking sites towards their group-level means, thereby allowing 


us to overcome potential bias due to low and varying sample sizes that can lead to 
extreme values from chance alone. After an initial log-normal model formulation, 
where we were not confident in model convergence, we employed a non-central 
t distribution at the observation scale, which facilitated model convergence and 
dampened any effects of potentially extreme reef-scale observations on the bright 
and dark spot estimates. Further, we did not consider a site a bright or dark spot if 
the group-level (that is, nation/state) mean included fewer than five sites. 
Analysing conditions at bright spots. For our preliminary exploration into why 
bright and dark spots may diverge from expectations, we surveyed data providers 
and other experts about key social, institutional, and environmental conditions 
at the 15 bright spots, 35 dark spots, and 14 sites that performed most closely to 
model specifications. Specifically, we developed an online survey (SI) using Survey 
Monkey (http://www.surveymonkey.com) software, which we asked data providers 
who sampled those sites to complete with input from local experts, where neces- 
sary. Data providers generally filled in the survey in consultation with nationally 
based field team members who had detailed local knowledge of the socioeconomic 
and environmental conditions at each of the sites. Research on bright spots in 
agricultural development’ highlights several types of social and environmental 
conditions that may lead to bright spots, which we adapted and developed proxies 
for as the basis of our survey into why our bright and dark spots may diverge from 
expectations. These include: 


(i) Social and institutional conditions. We examined the presence of custom- 
ary management institutions such as taboos and marine tenure institutions, 
whether there was substantial engagement by local people in management 
(specifically defined as there being active engagement by local people in reef 
management decisions; token involvement and consultation were not consid- 
ered substantial engagement), and whether there were high levels of depend- 
ence on marine resources (specifically, whether a majority of local residents 
depend on reef fish as a primary source of food or income). All social and 
institutional conditions were converted to presence/absence data. Depend- 
ence on resources and engagement were limited to sites that had adjacent 
human populations. All other conditions were recorded regardless of whether 
there is an adjacent community. 

(ii) Technological use/innovation. We examined the presence of motorized ves- 
sels, intensive capture equipment (such as beach seine nets, surround gill 
nets, and muro-ami nets), and storage capacity (that is, freezers). 

(iii) External influences (such as donor-driven projects). We examined the pres- 
ence of NGOs, fishery development projects, development initiatives (such 
as alternative livelihoods), and fisheries improvement projects. All external 
influences were recorded as present/absent then summarized into a single 
index of whether external projects were occurring at the site. 

(iv) Environmental/ecological processes (for example, recruitment and con- 
nectivity). We examined whether sites were within 5 km of mangroves and 
deep-water refuges, and whether there had been any major environmental 
disturbances such as coral bleaching, tsunami, and cyclones within the past 
5 years. All environmental conditions were recorded as present/absent. 


As an exploratory analysis of associations between these conditions and 
whether sites diverged more or less from expectations, we used two complemen- 
tary approaches. The link between the presence/absence of the aforementioned 
conditions and whether a site was bright, average, or dark was assessed using a 
Fisher's exact test. Then we tested whether the mean deviation in fish biomass from 
expected was similar between sites with presence or absence of the mechanisms 
in question (that is, the presence or absence of marine tenure/taboos) using an 
ANOVA assuming unequal variance. The two tests yielded similar results, but 
provide slightly different ways to conceptualize the issue, the former is correlative 
while the latter explains deviation from expectations based on conditions, so we 
provide both (Fig. 3 and Extended Data Fig. 4). It is important to note that some 
of these social and environmental conditions were significantly associated (that 
is, Fisher's exact probabilities <0.05), and further research is required to uncover 
how these and other conditions may make sites bright or dark. 
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Extended Data Figure 1 | Marginal relationships between reef fish 
biomass and social drivers. a, Local population growth; b, market 
gravity; c, nearest settlement gravity; d, tourism; e, nation/state 
population size; f, Human Development Index; g, high compliance 
marine reserve (0 is fished baseline); h, restricted fishing (0 is fished 
baseline); i, low-compliance marine reserve (0 is fished baseline); j, voice 
and accountability; k, reef fish landings; 1, ocean productivity; m, depth 
(—1=0-4m, 0=4-10m, 1=>10m);n, reef flat (0 is reef slope baseline); 
o, reef crest flat (0 is reef slope baseline); p, lagoon/back reef flat (0 is reef 


slope baseline). All variables displayed on the x axis are standardized. Red 
lines are the marginal trend line for each parameter as estimated by the full 
model. Grey lines are 100 simulations of the marginal trend line sampled 
from the posterior distributions of the intercept and parameter slope, 
analogous to conventional confidence intervals. Two asterisks indicate that 
95% of the posterior density is in either a positive or negative direction 
(Fig. 1b-d); a single asterisk indicates that 75% of the posterior density is 
in either a positive or negative direction. 
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Parents Data Figure 2 ; Garcitcn plot of candidate continuous covariates before accounting for aaa (Extended Data Table 4). 
Collinearity between continuous and categorical covariates (including biogeographic region, habitat, protection status, and depth) were analysed using 
box plots. 
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Extended Data Figure 3 | Model fit statistics. Top, Bayesian P values (BpV) and the data. Bottom, Posterior distribution for the degrees of freedom 
for the full model indicating goodness of fit, based on posterior discrepancy. parameter (Vv) in our supplementary analysis of candidate distributions. The 


Points are Freeman-Tukey differences between observed and expected highest posterior density of 3.46, with 97.5% of the total posterior density 
values, and simulated and expected values within the MCMC scheme below 4 provides strong evidence in favour of a non-central t distribution 
(n= 10,000). Plot shows no evidence for lack of fit between the model relative to a normal distribution and supports the use of 3.5 for v. 
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Extended Data Table 1 | 


Covariate 


Local population growth 


‘Gravity’ of major markets 


within 500km 


‘Gravity’ of the closest 


human settlement 


Protection status 


Human Development 


index 


Population Size 


Tourism 


Voice and accountability 


Fish landings 


Productivity 


Habitat 


Depth 


Summary of social and environmental covariates 


Description 


Difference in local human population (i.e. 100km buffer 


around our sites) between 2000-2010 


The population of the major market divided by the squared 
travel time between the reef sites and the market. This value 


was summed for all major markets within 500km of the site. 


The population of the nearest human settlement divided by 
the squared travel time between the reef site and the 


settlement. 


Whether the reef is openly fished, restricted (e.g. effective 


gear bans or effort restrictions), or unfished 


A summary measure of human development encompassing: 
a long and healthy life, being knowledgeable and have a 
decent standard of living. We used linear and quadratic 


functions for HDI. 


Total population size of the jurisdiction 


Proportion of tourist visitors to residents 


Perceptions of the extent to which a country's citizens are 


able to participate in selecting their government. 


Landings of reef fish (tons) per Km? of reef 


The average (2005-2010) ocean productivity in mg C / m2 / 
day 


Whether the reef is a slop, crest, flat, or back reef/lagoon 


Depth of the ecological survey (<4m, 4.1-10m, >10m) 


Scale 


Site 


Site 


Site 


Reef 


Nation/state 


Nation/ 


state 


Nation/ 


state 


Nation/ 


state 


Nation/ 


state 


Site 


Reef 


Reef 


Key data sources 

Socioeconomic Data and Application Centre (GEDAC) 
gridded population of the work database*6 

Human population size, land cover, road networks, 


coastlines 


Human population size, land cover, road networks, 


coastlines 


Expert opinion, global map of marine protected areas. 


United Nations Development Programme 


World Bank, census estimates, Wikipedia 


World Tourism Organization’s Compendium of Tourism 


Statistics, census estimates 


World Bank 


Teh et al.5° 


http://www.science.oregonstate.edu/ocean.productivity/ 


Primary data 


Primary data 


Further details can be found in the Methods. The smallest scale is the individual reef. Sites consist of clusters of reefs within 4 km of each other. Nations/states generally correspond to countries, but 
can also include or territories or states, particularly when geographically isolated (for example, Hawaii). Refs 36 and 50 are cited in this table. 
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Extended Data Table 2 | List of nations/states covered in study and their respective average biomass (kg ha"!+ standard error) 


Nation/states 


Average biomass 


American Samoa 235.93 
Australia 735.01 
Belize 981.16 
Brazil 663.35 
British Indian Ocean Territory (Chagos) 2975.58 
Cayman Islands 464.09 
Colombia 846.07 
Commonwealth of the Northern Mariana Islands 505.54 
Comoros Islands 305.62 
Cuba 2107.37 
Egypt 552.73 
Farquhar 2665.48 
Federated States of Micronesia 377.90 
Fiji 1464.54 
Florida 1661.35 
French Polynesia 1077.20 
Guam 118.98 
Hawaii 380.45 
Indonesia 275.76 
Israel 445.16 
Jamaica 275.77 
Kenya 335.25 
Kiribati 1219.93 
Madagascar 409.48 
Maldives 688.64 
Marshall Islands 707.72 
Mauritius 166.93 
Mayotte 631.43 
Mexico 1930.81 
Mozambique 461.01 
Netherlands Antilles 428.01 
New Caledonia 1460.27 
NW Hawaiian Islands 729.71 
Oman 282.79 
Palau 3212.26 
Panama 373.78 
Papua New Guinea 566.70 
Philippines 202.62 
Pacific Remote Island Areas (PRIA), USA 641.47 
Reunion 172.32 
Seychelles 446.99 
Solomon Islands 1280.30 
Tanzania 346.29 
Tonga 1149.97 
United Arab Emirates 81.35 
Venezuela 1472.39 


@ SE) 


(£17.75) 
(+ 136.85) 
(+ 65.32) 
(£115.17) 
(+ 603.99) 
(+ 25.41) 
(+ 162.49) 
(+ 99.3) 
(+ 38.73) 
(+ 466.34) 
(+ 70.18) 
(+ 492.62) 
NA (n=1) 
(+ 144.39) 
(+ 198.42) 
(+ 101.4) 
(+ 16.81) 
(+ 25.11) 
(+ 19.89) 
(+ 105.13) 
(4 50.75) 
(+ 65.81) 
(4 93.2) 
(+ 46.1) 
(+ 97.07) 
(+ 174.38) 
(+ 73.7) 
(+ 68.25) 
(+ 737.09) 
(+ 60.14) 
(+ 53.99) 
(+ 143.18) 
(+ 46.33) 
(+ 70.22) 
(4 332.02) 
(+ 85.41) 
(431.76) 
NA (n=1) 
(4 79.25) 
(+ 30.67) 
(+ 46.6) 
(4216.74) 
(441.51) 
(+ 151.27) 
(+ 28.66) 
(+ 496.95) 


In most cases, nation/state refers to an individual country, but can also include states (for example, Hawaii or Florida), territories (for example, British Indian Ocean Territory), or other jurisdictions. 
We treated the northwestern Hawaiian islands and Farquhar as separate ‘nation/states’ from Hawaii and the Seychelles, respectively, because they are extremely isolated and have little or no human 
population. In practical terms, this meant different values for a few nation/state scale indicators that ended up having relatively small effect sizes (Fig. 1b): population, tourism visitations, and in the 
case of the northwestern Hawaiian islands, fish landings. 
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Extended Data Table 3 | Model selection of potential gravity indicators and components 


Model Covariates Alc Delta AIC 
M2 Gravity of nearest city 2666.4 0 
M1 Gravity of all cities in 500km 2669.5 3.1 
M3 Travel time to nearest city 2700.0 33.6 
M5 Gravity of nearest small peri-urban area (40 people/km2) 2703.9 37.5 
M11 Total Population in 500km radius 2712.0 45.6 
M9 Travel time to the nearest large peri-urban area (75 people/km2) = 2712.1 45.7 
M6 Travel time to nearest small peri-urban area (40 people/km2) 2713.8 47.4 
M8 Gravity to the nearest large peri-urban area (75 people/km2) 2722.9 56.5 
M7 Population of nearest small peri-urban area (40 people/km2) 2792.7 126.3 
M4 Population of the nearest city 2812.8 146.5 
M10 Population of the nearest large peri-urban area (75 people/km2) 2822.2 155.8 
Mo Intercept only 2827.7 161.27 
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Extended Data Table 4 | Variance inflation factor (VIF) scores for continuous data before and after removing variables due to collinearity 


Covariate 


Market gravity (log) 


Nearest settlement gravity 


Population growth 
Climate stress 
Ocean productivity 
Sedimentation 
Tourism 

Control Corruption 
GDP 

HDI 

Population size 
Reef fish landings 
Rule of Law 


Voice and Accountability 


X=covariate removed. 


starting VIF 


1.9 
1.4 
1.4 
2.7 
6.5 
6.0 
2.5 
10.5 
8.2 
5.5 
1.9 
3.1 
33.8 
3.2 


ending VIF 


15 
1.3 
1.3 

X 
2.2 


3.2 
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Extended Data Table 5 | List of bright and dark spot locations, population status, and protection status 


even Eee Nation/State Location Populated Protection 
British Indian Ocean Territory Chagos Unpopulated pola (high 
compliance) 
Commonwealth of the Northern Mariana Agrihan Unpopulated Fished 
Islands Guguan Unpopulated Fished 
Raja Ampat 1 Populated Restricted 
Indonesia Raja Ampat 2 Populated Restricted 
Kalimantan Populated Restricted 
; By es Tabueran 1 Populated Fished 
Bright Feria Tabueran 2 Populated Fished 
Papua New Guinea Karkar Populated Restricted 
PRIA Baker Unpopulated Restricted 
Jarvis Island Unpopulated Restricted 
Choiseul Populated Fished 
Soloman lelands Isabel Populated Fished 
Makira Populated Fished 
New Georgia Populated Fished 
Australia Lord Howe Populated elated 
Hawaii Populated Fished 
Kauai 1 Populated Fished 
Kauai 2 Populated Fished 
Lanai Populated Fished 
Maui 1 Populated Fished 
Maui 2 Populated Fished 
Hawaii Molokai Populated Fished 
Oahu 1 Populated Fished 
Oahu 2 Populated Fished 
Oahu 3 Populated Fished 
Oahu 4 Populated Fished 
Oahu 5 Populated Fished 
Oahu 6 Populated Fished 
Karimunjawa 1 Populated Fished 
Indonesia Karimunjawa 2 Populated Unfished (low compliance) 
Karimunjawa 3 Populated Unfished (low compliance) 
Pulau Aceh Populated Fished 
Montego Bay 1 Populated Unfished (low compliance) 
Dark Jamaica Montego Bay 2 Populated Fished 
Rio Bueno Populated Fished 
Kenya Diani Populated Fished 
Madagascar Toliara Populated Fished 
Mauritius Anse Raie Populated Fished 
Grand Sable Populated Fished 
Lisianski Unpopulated se aed ad 
NW Hawaii Pearl & Hermes 1 Unpopulated Uintiehes thigh 
compliance) 
Pearl & Hermes 2 Unpopulated pilelies (high 
compliance) 
Reunion Reunion Populated Fished 
Seychelles Bel Ombre Populated Restricted 
Bongoyo Populated pa 
Tansania Chapwani Populated Fished 
Mtwara Populated Fished 
ae ee Populated Fished 
Venezuela Chuspa Populated Fished 
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Prefrontal neuronal assemblies temporally control 


fear behaviour 


Cyril Dejean!*, Julien Courtin!*3*, Nikolaos Karalis!*4*, Fabrice Chaudun!?, Héléne Wurtz!*, Thomas C. M. Bienvenu? 


& Cyril Herry!? 


Precise spike timing through the coordination and synchronization 
of neuronal assemblies is an efficient and flexible coding mechanism 
for sensory and cognitive processing! In cortical and subcortical 
areas, the formation of cell assemblies critically depends on 
neuronal oscillations, which can precisely control the timing of 
spiking activity”*®. Whereas this form of coding has been described 
for sensory processing and spatial learning” |”, its role in encoding 
emotional behaviour remains unknown. Fear behaviour relies on the 
activation of distributed structures, among which the dorsal medial 
prefrontal cortex (dmPFC) is known to be critical for fear memory 
expression!?-!°, In the dmPFC, the phasic activation of neurons to 
threat-predicting cues, a spike-rate coding mechanism, correlates 
with conditioned fear responses and supports the discrimination 
between aversive and neutral stimuli!*!”~!°. However, this 
mechanism does not account for freezing observed outside stimuli 
presentations, and the contribution of a general spike-time coding 
mechanism for freezing in the dmPFC remains to be established. 
Here we use a combination of single-unit and local field potential 
recordings along with optogenetic manipulations to show that, in 
the dmPFC, expression of conditioned fear is causally related to 
the organization of neurons into functional assemblies. During 
fear behaviour, the development of 4 Hz oscillations coincides with 
the activation of assemblies nested in the ascending phase of the 
oscillation. The selective optogenetic inhibition of dmPFC neurons 
during the ascending or descending phases of this oscillation blocks 
and promotes conditioned fear responses, respectively. These 
results identify a novel phase-specific coding mechanism, which 
dynamically regulates the development of dmPFC assemblies to 
control the precise timing of fear responses. 

To evaluate the contribution of prefrontal spike-time coding 
mechanisms to fear behaviour, mice were implanted with recording 
electrodes targeting the dmPFC and submitted to discriminative fear 
conditioning (Fig. la). In this behavioural model, mice learned to 
discriminate between two auditory stimuli of different frequency. The 
conditioned stimulus termed CS? is associated with the delivery of a 
mild foot-shock (the unconditioned stimulus, US) whereas a control 
stimulus, termed CS~, is not. Twenty-four hours after conditioning, 
when re-exposed to the CS* but not the CS”, mice displayed a selective 
increase in conditioned freezing, a characteristic fear immobiliza- 
tion reaction (Fig. la). Freezing was not only driven by CS sensory 
stimulations but was also observed between CS presentations, suggest- 
ing the involvement of internally generated mechanisms triggering/ 
maintaining freezing responses (Extended Data Fig. 1). Interestingly, 
during freezing, a subset of putative principal neurons (PNs) 
synchronized briefly and repeatedly, which represents a main charac- 
teristic of neuronal assemblies. To identify whether the formation of 
dmPFC assemblies may encode freezing, we investigated the presence 
of freezing-specific coactivation patterns in simultaneously recorded 


dmPFC PNs (Extended Data Fig. 2). We calculated correlation matrices 
between spike trains of dmPFC PNs within a sliding window of 
150 ms and performed principal component analyses on these 
correlation matrices (see Methods and Extended Data Fig. 3). In each 
animal, these analyses identified a discrete pattern of correlation that 
was freezing specific. This pattern corresponded to a subpopulation of 
PNs repeatedly recruited during freezing (Fig. lb-d and Extended Data 
Fig. 3g). These dmPFC PNs displayed enhanced firing and coactivation 
each time the correlation pattern occurred, hence forming assemblies of 
PNs. These assembly neurons (ANs) were functionally segregated from 
ONs as coactivation between those two populations was not different 
from chance (Fig. 1d-f and Extended Data Fig. 3). Importantly, AN 
firing rate was stable during freezing, indicating that assembly detection 
was not due to tonic increase in firing (Fig. 1c and Extended Data 
Fig. 4a). During CS* presentations, the firing profile of dmPFC neurons 
was highly heterogeneous, with AN and ON exhibiting undifferen- 
tiated CS-evoked excitation, inhibition or no responses, as recently 
observed'*'” (Extended Data Fig. 4c—f). In addition, the probability 
of observing assembly activation during freezing was similar inside or 
outside CS* presentations and did not vary around CS* presentations 
(Extended Data Fig. 4b, g). Together, these data indicate that a specific 
subpopulation of dmPFC PNs participates in functional assemblies 
during fear expression. 

After conditioning, analyses of dmPFC local field potentials (LFPs) 
revealed a prominent oscillation in the 3-6 Hz range, with a peak 
around 4 Hz, which develops during freezing and temporally matched 
the recruitment of dmPFC assemblies (Fig. 2a—b). Consistent with 
our previous observations”, power spectral and spectrogram analyses 
indicated that dmPFC 4Hz oscillation power was significantly higher 
during freezing compared with mobility periods, and that freezing 
onset and offset coincided with an increase and decrease of 4 Hz 
power, respectively (Fig. 2c-d). Strikingly, frequency-specific cor- 
relational analyses revealed a significant and maximum correlation 
between dmPFC LFP 4 Hz power and assembly probability within 
epochs surrounding freezing onset and offset (Fig. 2e). To evaluate 
whether the development of 4 Hz oscillations and the formation of 
dmPEC assemblies was predictive of freezing, we computed the first 
time point around freezing onset and offset for which 4 Hz oscilla- 
tions and assembly probability significantly changed. These analyses 
indicated that both 4 Hz power and assembly activation significantly 
increased and decreased before freezing onset and offset, respectively 
(Fig. 2f). Importantly, the increase in 4 Hz power preceded the changes 
in assembly activation, which indicates that 4 Hz oscillations are a 
prerequisite process for dmPFC assembly activation and freezing onset. 
These data indicate that freezing dynamics are precisely controlled by 
a 4 Hz-mediated recruitment of dmPFC assemblies. 

To understand how 4 Hz oscillations dynamically shape dmPFC 
assemblies during freezing, we evaluated the phase locking of 
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Figure 1 | dmPFC PNs participate in neuronal assemblies during 
freezing. a, Protocol and behavioural results. During habituation, mice 
(n=7) exhibited low freezing during CS~ and CS". After conditioning 
(post-FC), CS* induced high freezing compared with CS~ (Friedman 
repeated-measures one-way analysis of variance (ANOVA) on ranks 

test, P < 0.01; Student-Newman-Keuls post-hoc test, CS~ versus each 
CS* block, all P < 0.05). b, Example of dmPFC ANs exhibiting temporal 
organization during freezing compared with no freezing and to ON. 

c, Left: representative example of assembly activation during freezing 
versus no freezing and averaged firing for AN (n = 16, black line; white 
shaded area, s.e.m.). Grey shaded area represents freezing periods. Right: 
average numbers of assemblies activated during no freezing (No freez.) 
and freezing (Freez.) (n =7 mice, Wilcoxon signed-rank test, 

no-freezing versus freezing, ***P < 0.001). d, Distribution of the assembly 
participation index (AP index) for 55 dmPFC PNs recorded in a mouse. 
The individual neuron AP index shows which PNs are significantly active 
(AN), or inhibited/unchanged (ON) within freezing patterns detected 

by principal component analysis (1, chance level). e, Coactivation matrix 
averaged over freezing epochs from the same 55 dmPFC PNs. The strength 
of the coactivation between neuron pairs is expressed as the percentage of 
coactivation compared with chance. f, Averaged coactivation for AN 
and/or ON pairs (one sample t-test, coactivation versus 100% hypothetical 
mean, P < 0.01; one-way ANOVA, P < 0.001; Bonferroni post-hoc test, 
AN x AN versus any other group, **P < 0.01). Shaded areas and error 
bars, mean +s.e.m. 


individual AN and ON to 4Hz oscillations (Fig. 3a, b). The fraction 
of dmPFC neurons significantly phase-locked to 4 Hz oscillations was 
larger for ANs than ONs (Fig. 3c). Remarkably, phase-locked ANs 
were highly selectivity of 4 Hz ascending phase, while the preferred 
phase distribution of ONs was homogenous (Fig. 3d, e). To control 
if ANs might only synchronize because of their 4 Hz phase locking 
or to 4 Hz power increase, we first evaluated the averaged pairwise 
co-firing activity between pairs of ANs or ONs as a function of 4 Hz 
phase. This co-firing activity was corrected for the temporal and phase 
relation between pairs of neurons using a shuffling and shift predictor 
approach (see Methods and Fig. 3f). The corrected co-firing activity 
of AN was significantly stronger in the ascending compared with the 
descending phase of 4 Hz oscillation, which rules out the possibility that 
AN synchronization is solely due to AN phase locking (Fig. 3f). Next, 
we computed the corrected co-firing of AN pairs as a function of 4Hz 
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Figure 2 | Temporal matching of 4 Hz oscillations and dmPFC 
assemblies during freezing. a, Examples of raw dmPFC LFP traces at 
the onset (top) and offset (bottom) of a freezing episode. b, Freezing 
episodes (top, black lines and grey boxes), LFP spectrogram in the 2-16 Hz 
frequency band (middle, log, scale), assembly activation and LFP power 
averaged in the 3-6 Hz band (bottom). c, Normalized LFP power spectra 
for freezing and no freezing in the 2-16 Hz band (n=7 mice; log» scale). 
Inset: 4 Hz to @ ratio for freezing and no freezing (n =7 mice, Wilcoxon 
signed-rank test, non-freezing versus freezing, *P < 0.05). d, Averaged 
peri-event spectrogram in the 2-16 Hz frequency band (log) scale) 
centred on freezing onset (left, n = 7 mice) and offset (right, n =7 mice). 
e, Correlation between power and assembly activation as a function of 
LFP frequency (log> scale) during epochs surrounding freezing onset 
(—3s to +3s, purple line, n=7 mice) and offset (—3s to +3, pink line, 
n=7 mice). The y axis represents the Spearman correlation coefficient. 
f, Normalized 3-6 Hz power and assembly activation z-score centred on 
freezing onset (left, n =7 mice) and offset (right, n =7 mice). Arrows 
indicate the first time point with a significant increase around freezing 
onset or decrease around freezing offset. Shaded areas and error bars, 
mean + s.e.m.; a.u., arbitrary units. 


power levels and phases (Fig. 3g). We observed that, for different power 
levels, synchronization among AN, but not ON, was restricted to the 
ascending phase of the oscillations and larger than ON synchronization 
(Fig. 3g). These data indicate that the coordination of dmPFC PNs 
into cell assemblies during freezing is orchestrated by 4Hz oscillations. 

To evaluate whether the formation of dmPFC assemblies was causally 
related to freezing expression, we used an optogenetic strategy based on 
the selective activation of dmPFC parvalbumin-expressing interneu- 
rons (PV*) in the ascending or descending phase of 4 Hz oscillations 
(see Methods, Fig. 4 and Extended Data Fig. 5a, b). Conditional 
adeno-associated virus (AAV) encoding for channelrhodopsin 
(ChR2) was injected in the dmPFC of mice expressing the Cre 
recombinase under the control of a PV promoter to specifically infect 
PV* interneurons (Fig. 4 and Extended Data Fig. 5a, b). Because 
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Figure 3 | Four-hertz oscillations organize dmPFC PNs into assemblies 
during freezing. a, Representative example of an LFP recorded in the 
dmPFC showing prominent 4 Hz oscillations and simultaneous sequential 
activation of 14 AN recorded in the same animal during freezing. 

b, Averaged firing of individual AN displayed in a as a function of 4Hz 
LFP oscillation phase. Each AN was significantly modulated by 4 Hz 
phase (Rayleigh test, all P < 0.05). c, Top: average percentage of neurons 
significantly phase-locked to 4 Hz oscillations among all neurons (blue), 
AN (red) and ON (white) (1 =7 mice, one way repeated-measures 
ANOVA, P < 0.001; Bonferroni post-hoc test, ***P < 0.001). Bottom: 
averaged mean resultant length vector (MRL) for phase-locked and non- 
phase-locked AN and ON neurons (n=7 mice, AN: n= 50/90 phase- 
locked neurons; ON: n= 61/200 phase-locked neurons; two-way ANOVA, 
F,: AN versus ON, P= 0.206; Fy: phase-locking, P < 0.001; Fy x Fo: 
P=0.324; Bonferroni post-hoc test, ***P < 0.001). d, Distribution of all 
phase-locked dmPFC PNs assembly participation indices (AP index) as a 
function of preferred phase (left axis, red dots: AN, AP index > 1, n=50 
neurons; white dots: ON, AP index < 1, n=61 neurons; bin: 18°, n =7 
mice). e, Distribution of phase-locked AN and ON preferred phases (AN: 
n=50 neurons; ON: n=61 neurons; bin: 18°, n=7 mice). f, Left: corrected 
pairwise co-firing activity of pairs of AN and ON asa function of 4 Hz 
phase (AN = 1,272 pairs; ON = 2,712 pairs recorded in 7 mice). Right: 
averaged corrected pairwise co-firing for AN and ON neurons during 

the ascending or descending phase of 4 Hz oscillations (n =7 mice, AN: 
n= 1,272 pairs, ON: n 2,712 pairs, two-way ANOVA, F): AN versus ON, 
P=0.003; Fy: ascending versus descending, P < 0.001; F x Fy: P < 0.001; 
Bonferroni post-hoc test, **P < 0.01, ***P < 0.001; one sample t-test 
against hypothetical mean = 0, P < 0.001). h, Left: corrected pairwise 
co-firing activity as a function of the phase and power (z-score) of 4 Hz 
oscillations. Right: delta-corrected pairwise co-firing activity (maximum 
ascending minus minimum descending phase) of pairs of AN and ON 
(AN = 1,272 pairs; ON = 2,712 pairs recorded in 7 mice; two-way ANOVA, 
F\: AN versus ON, P < 0.001; F): power, P < 0.001; F) x Fy: P < 0.001; 
Bonferroni post-hoc AN versus ON, **P < 0.01, ***P < 0.001; one sample 
t-test against hypothetical mean = 0, P < 0.001). Error bars, mean + s.e.m. 


dmPFC assemblies are both phase- and freezing-specific, we recorded 
and performed online detection of freezing, high instantaneous power 
and ascending phase of 4 Hz oscillations in the period after a train of 
CS* presentations (See methods, Extended Data Fig. 6). Each time these 
conditions were fulfilled, we optogenetically activated dmPFC PVt 
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interneurons to inhibit PN firing at the ascending or descending phase 
of 4 Hz oscillations (Fig. 4a, d and Extended Data Figs 5-7). During 
post-CS* presentations, green fluorescent protein (GFP) controls exhib- 
ited high fear levels on day 2 and low fear levels on days 3 and 4, which 
represent a natural timing effect, not a bias of our experimental condi- 
tions (Extended Data Fig. 6c). Importantly, activating or inhibiting PV* 
interneurons in the ascending or descending phase of 4Hz oscillations 
did not modify 4 Hz power compared with periods without stimulation 
or GFP controls (Extended Data Fig. 5d-i). In contrast, the activation or 
inhibition of PV~ interneurons in the ascending or descending phase 
of 4 Hz oscillations respectively reduced and increased the co-firing 
activity of dmPFC PNs in the phase targeted (Extended Data Fig. 8c, d). 
Optogenetic-mediated inhibition of dmPFC PNs in the ascending 
phase of the oscillation significantly reduced freezing compared 
with control conditions (Fig. 4b, c, g, h and Extended Data Fig. 6). To 
evaluate the temporal specificity of our manipulation, we inhibited the 
firing of dmPFC PNs in the descending phase of the oscillation during 
freezing, when dmPFC ANsare less likely to be coactive (Fig. 3d-f). 
This manipulation significantly increased freezing compared with 
control conditions (Fig. 4e-h and Extended Data Fig. 6). Importantly, 
because we performed these optogenetic manipulations over 2 days 
(Extended Data Fig. 6d), we ensured that the stimulation in the 
ascending phase of 4 Hz oscillations on the first day had no influ- 
ence on CSt-evoked freezing the next day (Extended Data Fig. 6g). 
Finally, we repeated our stimulation experiment in mice in which PV* 
interneurons expressed archeorhodopsin (ArchT) to inhibit dmPFC 
PV interneurons and thereby disinhibit dmPFC PNs (Extended 
Data Fig. 9). When applied in the descending phase of the oscillation, 
stimulation had no effect on freezing (Extended Data Fig. 9b, c). 
However, in the ascending phase of the oscillation, PN disinhibition 
produced a significant increase in freezing (Extended Data Fig. 9d, e). 
Together, these results demonstrate that the formation of dmPFC 
assemblies at the ascending phase of 4 Hz oscillations and their 
suppression in the descending phase are necessary for the precise 
temporal coding of fear behaviour. 

Using single-unit and LFP recordings and optogenetic manipulations, 
we demonstrated that a subset of dmPFC PNs participate in the for- 
mation of functional assemblies, dynamically coordinated by 4 Hz 
oscillations during freezing. Our results indicate that the formation 
of dmPFC assemblies at specific phases of 4 Hz oscillations is causally 
related to freezing expression. Importantly, although the strength of 
phase locking or the increase in 4 Hz power might impact neuronal 
synchronization among ANs, our analyses indicate that they cannot 
explain alone the specific and stronger synchronization we observed 
between AN (Fig. 3f, g). Therefore the temporal organization of ANs in 
the ascending phase of 4 Hz oscillations appears as a specific encoding 
mechanism for freezing. Modification of dmPFC spiking activity 
after presentations of threat-predicting cues, a form of rate coding 
mechanism that correlates with fear responses, has been described 
in the past!*!7!82! and could be related to plasticity mechanisms 
or attentional processes. Our data indicate that, in addition to this 
mechanism, the expression of conditioned freezing is also encoded in 
the dynamic organization of internally generated dmPFC assemblies. 
To our knowledge, these data represent the first causal demonstra- 
tion of a phase-specific coding mechanism during conditioned fear 
behaviour. In a recent study, we imposed a 4 Hz analogue stimulation 
to the dmPFC through PV* interneurons and showed that this artificial 
rhythm was sufficient to elicit freezing and fear memory”°. The concept 
in the present model is radically different in that we did not impose the 
rhythm but intended to interfere with dmPFC assemblies nested in the 
ascending phase of 4 Hz oscillations, which were specifically activated 
during fear expression (see Supplementary Information for further 
discussion). Whereas 4 Hz oscillations have already been described 
in different structures during various behavioural tasks2%2?-25, 
the origin of this slow oscillation is still unknown. It was suggested 
that this oscillation originates from the ventral tegmental area‘, or 
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Figure 4 | Phase-specific optogenetic inhibition of dmPFC PNs controls 
freezing. a, d, Schematic of the strategy used to inhibit dmPFC PNs during 
the ascending or descending phase of 4 Hz oscillation. Stim., stimulation. 
b, e, Top: representative example of freezing (black line) and kinetics of 
light pulse stimulations (blue ticks) during and after CS* presentations 
24h (b) and 48h (e) after conditioning. Bottom: corresponding LFP 
spectrogram in the 2-16 Hz band (binary logarithmic scale) during 
light-mediated inhibition of dmPFC assemblies in the ascending (b) or 
descending (e) phase of the oscillation. c, f, Left: time-resolved changes 

in freezing during light-mediated inhibition of dmPFC assemblies in the 
ascending (c) or descending (f) phase of the 4 Hz oscillation in PV-IRES- 


Cre mice infected within the dmPFC with 
CS*, n=7; descending phase: first CS*, n 
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two-way ANOVA, factor 1: ChR2 versus GFP, P < 0.001; factor 2: time, 
P< 0.001; F, x F, P < 0.05; descending phase: two-way ANOVA, factor 1: 
ChR2 versus GFP, P < 0.001; factor 2: time, P < 0.001; F; x Fo, P < 0.05). 
Shaded area represents light stimulation period. Right: averaged freezing 
from PV-IRES-Cre mice infected in dmPFC with ChR2 (n =7) or GFP 
(n=7) during the stimulation (Mann-Whitney test, ChR2 versus GFP, 
**P < 0.01). g, h, Percentage of freezing rescue (freezing for ChR2 minus 
average freezing for GFP) as a function of the phase of the stimulation 
(g), or as a function of both the phase and pulse stimulation accuracy 

(h) for mice stimulated in the ascending or descending phase of the 4 Hz 
oscillation (n = 25 sessions in 16 animals). Pulse stimulation accuracy to 
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METHODS 

Animals. Male C57BL6/J mice (3 months old, Janvier), PV-IRES-Cre mice 
(3 months old, Jackson Laboratory, B6;129P2-Pvalb'™")4" 7/7) and CamKIalpha- 
Cre mice (3 months old, Jackson Laboratory, B6.Cg-Tg(Camk2A-cre)T29-1Stl/J) 
were individually housed for at least 7 days before all experiments, under a 12-h 
light-dark cycle, and provided with food and water ad libitum. All procedures were 
performed in accordance with standard ethical guidelines (European Communities 
Directive 86/60-EEC) and were approved by the committee on Animal Health 
and Care of Institut National de la Santé et de la Recherche Médicale and French 
Ministry of Agriculture and Forestry (agreement A3312001). Representative 
examples and traces displayed in the main figures were observed for the seven 
animals used in these experiments. 

Behaviour. Fear conditioning and testing took place in two different contexts 
(contexts A and B). The conditioning and testing boxes were cleaned with 70% 
ethanol and 1% acetic acid before and after each session, respectively. To score 
freezing behaviour, an automated infrared beam detection system located on 
the bottom of the experimental chambers was used (Coulbourn Instruments). 
Because the detection of our dependent variable (freezing) was independent of the 
experimenter, we did not use a blinding process for group allocation or behaviour 
scoring. The animals were considered to be freezing if no movement was detected 
for 2s. On day 1, C57BL6/J mice were submitted to a habituation session in context 
A, in which they received four presentations of the CS* and the CS~ (total CS 
duration 30s, consisting of 50-ms pips at 0.9 Hz repeated 27 times, 2 ms rise and fall; 
pip frequency 7.5 kHz or white-noise, 80 dB sound pressure level). Discriminative 
fear conditioning was performed on the same day by pairing the CS* with a US 
(1-s foot-shock, 0.6mA, 5 CS*t-US pairings; inter-trial intervals, 20-180s). The 
onset of the US coincided with the offset of the CS*. The CS” was presented after 
each CSt-US association but was never reinforced (five CS presentations; inter- 
trial intervals, 20-180s). The frequencies used for CSt and CS” were counterbal- 
anced across animals and randomization of CS~ and CS? allocation was performed 
using an online randomization algorithm (http://www.randomization.com/). On 
day 2, conditioned mice were submitted to a post-fear-conditioning session in 
context B during which they received 4 and 12 presentations of the CS~ and CST, 
respectively. Seven naive C57BL6/J mice were included in this experiment and the 
data were collected in two distinct replicates. For optogenetic experiments using 
channelrhodopsin (ChR2), archeorhodopsin (ArchT) or GFP controls, PV-IRES- 
Cre mice were submitted to the same conditioning protocol described above except 
that on days 2, 3 and 4, conditioned mice were submitted to post-fear-conditioning 
sessions in context B during which they received four presentations of the CS~ 
followed by four presentations of the CS*. The number of CS* was limited to four 
to prevent fear memory extinction from day 2 to days 3 and 4. Blue (ChR2) or 
Yellow (ArchT) light stimulation was applied during 30s after the last pip of the 
four CS*. The rationale behind this stems from our previous study’? showing that 
manipulation of CS*-evoked dmPFC activity impacted fear expression. To avoid 
interference with this activity period, the present manipulations were restricted 
to those 30s immediately after sound presentation. Importantly, freezing levels 
on day 1 were of similar magnitude in GFP control and wild-type animals used in 
the first part of this study (Extended Data Fig. 6a, b). Typically, freezing levels in 
GEP controls during CS* were not different from day 2 to days 3 and 4 (Extended 
Data Fig. 6c). However, freezing level observed in the 30s time windows after CST 
went from high on day 2 to moderate on days 3 and 4 (Extended Data Fig. 6c). 
Our hypothesis predicts that 4 Hz ascending phase inhibition of PNs will decrease 
freezing and conversely for a stimulation in the descending phase. To evaluate the 
changes in freezing levels upon optogenetic stimulation, this behavioural pattern 
brought us to test ascending phase stimulation on day 2 and descending phase on 
day 3 to avoid floor and ceiling effect respectively (Extended Data Fig. 6d). For 
the same reasons, 4 Hz descending and ascending phase disinhibition of PNs were 
performed on days 3 and 4, respectively (Extended Data Fig. 9a). We also noticed a 
tendency for optogenetic stimulation to decrease freezing level around subsequent 
CS* on the same day in the ChR2 group (Extended Data Fig. 6e, f). To rule out 
the possibility that changes in freezing behaviour upon optogenetic stimulation 
were not solely due to the direct action of the stimulation but also to a cumula- 
tive effect of past manipulations, our analyses were restricted to the 30s epoch 
after the first CS* on days 2, 3 and 4. Although this decrease was not observed 
with ArchT animals (Extended Data Fig. 9b, d) we used the same restriction with 
this group for the sake of consistency with ChR2 group. Note that within these 
30s, light application was conditioned to the behaviour and the brain state of the 
animal (see section on ‘Closed-loop stimulatiom). To score freezing behaviour, 
the aforementioned automated infrared beam detection was used in addition to 
a Cineplex video tracking system (Plexon) that was used for online detection of 
freezing and subsequent conditional optogenetic stimulation. For optogenetic 
experiments, four distinct behavioural experiments were performed to collect 
the entire data set. The impact of stimulation pulse phase specificity on freezing 
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behaviour was assessed by calculating freezing rescue as a function of pulses 
preferred 4 Hz phase as well as phase locking strength. Freezing rescue was 
calculated as the difference between the average freezing levels of GFP control 
and that of ChR2 animal during optical stimulation epochs. Stimulation pulses 
preferred phase was retrieved through the pulse phase histogram and tested for 
significance (Rayleigh’s test P). Pulse accuracy was quantified by phase locking 
strength using the mean resultant length (MRL): that is, the length of the mean 
vector. Low or high MRL values are respectively indicative of a spread or a con- 
centrated circular distribution of pulses around the preferred phase. In a first 
set of analyses we considered only those experiments with canonic features 
(Fig. 4a-g): that is, where pulse phase distribution was significant (P < 0.05), 
preferred phase was in the intended range (ascending phase, — 180 to 0°; descend- 
ing phase, 0-180°) and MRL was above 0.2. For ArchT experiments, five animals 
fulfilled the criteria on days 3 and 4. For ChR2 experiments, on day 2 (ascending 
phase) seven animals fulfilled the criteria; on day 3 (descending phase) seven ani- 
mals fulfilled the criteria. In a second set of experiment (Fig. 4h) we addressed the 
joint impact of phase and pulse accuracy on freezing level. For that purpose, we 
added sessions with weak MRL values to the previous set of analysed data. On day 2 
(ascending phase) 14 sessions fulfilled the criteria; on day 3 (descending phase) 
11 sessions fulfilled the criteria. That is a total of 25 sessions that were acquired in 
16 animals as follows: 9 animals on both days 2 and 3; 5 animals only on day 2; and 
2 animals only on day 3. The distributions of pulse accuracy (MRL) and preferred 
phase were then two-dimensionally interpolated to create a linearly spaced grid 
of freezing rescue value (bin MRL 0.02; bin phase 36°). 

Surgery and recordings. Mice were anaesthetized with isoflurane (induction 3%, 
maintenance 1.5%) in oxygen. Body temperature was maintained at 37°C with a 
temperature controller system (FHC). Mice were secured in a stereotaxic frame 
and unilaterally implanted in the left dorsomedial prefrontal cortex (dmPFC) with 
a multi-wire electrode array aimed at the following coordinates: 2mm anterior 
to bregma; 0.3 mm lateral to the midline; and 0.8-1.4mm ventral to the cortical 
surface. The electrodes consisted of 16 or 32 individually insulated nichrome 
wires (131m diameter, impedance 30-100kQ; Kanthal) contained in a 26-gauge 
stainless-steel guide cannula. The wires were attached to an 18-pin connector 
(Omnetics) or 2 connectors in the case of a 32-wire assembly. All implants were 
secured using Super-Bond cement (Sun Medical). After surgery mice were allowed 
to recover for 7 days and were habituated to handling. Analgesia was applied before, 
and 1 day after surgery (Metacam, Boehringer). Electrodes were connected to 
one or two headstages (Plexon) containing 16 unity-gain operational amplifiers. 
Each headstage was connected to a 16-channel preamplifier (gain 100 x bandpass 
filter from 150 Hz to 9kHz for unit activity and from 0.7 Hz to 170 Hz for field 
potentials, Plexon). Spiking activity was digitized at 40 kHz and bandpass filtered 
from 250 Hz to 8kHz, and isolated by time-amplitude window discrimination and 
template matching using a Multichannel Acquisition Processor system (Plexon). 
At the conclusion of the experiment, recording sites were marked with electro- 
lytic lesions before perfusion, and electrode tips locations were reconstructed with 
standard histological techniques. 

Virus injections and optogenetics. For optical control of PV interneurons, 
conditional AAV encoding ChR2 (AAV-EF;a-DIO-hChR2(H134R)-EYFP, 
serotype 5, Vector Core, University of North Carolina) or ArchT (AAV-FLEX- 
ArchT-GFP, serotype 9, Vector Core, University of North Carolina) were bilat- 
erally injected into the dmPFC of PV-IRES-Cre mice from glass pipettes (tip 
diameter 10-20 1m) connected to a picospritzer (Parker Hannifin Corporation; 
approximately 0.4\1L per hemisphere) at the following coordinates: 2mm anterior 
to bregma; 0.4mm lateral to midline and 0.9 to 1.2 mm ventral to the cortical 
surface. One to two weeks after the injection mice were implanted bilaterally with 
optic fibres (diameter: 200|1m; numerical aperture: 0.37; flat tip; Doric Lenses) 
at the same coordinates. Control experiments were performed using an AAV 
containing the DNA construct for only GFP (AAV-FLEX-GFP, Vector Core, 
University of North Carolina). All implants were secured using Super-Bond 
cement (Sun Medical). For experiments using optogenetic stimulation coupled 
to single-unit recordings, one of the two optic fibres was combined to the array 
of 16 or 32 individually insulated nichrome wires. Single-unit recordings during 
the manipulation of PV interneurons were performed as described in the ‘Surgery 
and recordings’ section. Behavioural and recording experiments were performed 
3-5 weeks after injection. The light (approximately 14mW per implanted fibre) 
was bilaterally conducted from the laser (OptoDuet 473/593 nm, Ikecool) to the 
mice via two fibre-optic patch cords (diameter 200 1m, Doric Lenses), connected 
to a rotary joint (1 x 2 fibre-optic rotary joint, Doric Lenses) that allowed mice to 
freely move in the behavioural apparatus. Instead of directly manipulating dmPFC 
PNs, we capitalized on the properties of cortical PV* interneurons, which are a 
particular class of GABAergic interneurons regulating efficiently the output activity 
of cortical principal excitatory neurons**"!. Because a single PV* interneuron 
can contact more than 1,500 PNs*”, the optogenetic activation or inhibition of 
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population of PV* represents an efficient tool for controlling PN activity 
This strategy was motivated by the fact that optogenetic manipulation of PNs was 
associated with large artefacts, which prevented the simultaneous recording of 
LFPs (Extended Data Fig. 5c). For optogenetic manipulation of PV™ interneurons 
during behaviour, we used 30 ms light pulses delivered under specific behavioural 
and neurophysiological conditions (see section on ‘Closed-loop stimulation’). 
After behavioural and recording experiments, mice were perfused and histological 
analysis was performed. 

Anatomical and histological analysis. Mice were euthanized with isoflurane and 
perfused through the left ventricle with 4% w/v paraformaldehyde in 0.1 M PBS. 
Brains were dissected out and postfixed for 4h at 4°C in the same solution. Sections 
(601m thick) were cut, mounted on gelatin-coated microscope slides and dried. 
Sections were stained with toluidine blue, dehydrated and mounted. Electrolytic 
lesions were identified with conventional transmission light microscopy. Only 
recordings with confirmed lesions in cingulate and prelimbic areas of dmPFC 
were included in our analyses. It is important to note that because the electrode 
bundles are manually fabricated in the laboratory and individual wires can be 
slightly displaced during insertion in the brain, the distance between each electrode 
cannot be measured precisely. It is therefore technically impossible to determine 
from which layer the single unit and LFP recorded from the same electrode were 
performed. For verification of viral injections in dmPFC (which include the 
cingulate cortex ACC and the prelimbic area), serial 80-|:m-thick slices containing 
the dmPFC were imaged using an epifluorescence system (Leica DM 5000) fitted 
with a 10x dry objective. The location and the extent of the injections/infections 
were visually controlled. Only infections accurately targeted to dmPFC were 
considered for behavioural and electrophysiological analyses. 

Single-unit analyses. Single-unit spike sorting was performed using Off-Line 
Sorter software (Plexon) for all behavioural sessions. Principal component scores 
were calculated for unsorted waveforms and plotted in a three-dimensional 
space defined by principal components and/or timing and voltage features of the 
waveforms; clusters containing similar valid waveforms were manually defined. 
A group of waveforms were considered to be generated from a single neuron 
if the waveforms formed a discrete, isolated, cluster in the three-dimensional 
space and did not contain a refractory period less than 1 ms, as assessed using 
inter-spike interval analysis. To avoid analysis of the same neuron recorded on 
different channels, we computed cross-correlation histograms. If a target neuron 
presented a peak of activity at a time that the reference neuron fired, only one of 
the two neurons was considered for further analysis. To separate putative inhibi- 
tory interneurons from putative excitatory PNs we used an unsupervised cluster 
algorithm based on Ward’s method. In brief, the Euclidian distance was calculated 
between all neuron pairs on the basis of the three-dimensional space defined by 
each neurons average half-spike width (measured from trough to peak), the firing 
rate and the area under the late positive deflection phase of the spike. An iterative 
agglomerative procedure was then used to combine neurons into groups based 
on the matrix of distances such that the total number of groups was reduced to 
return the smallest possible increase in within-group sum of square deviation. 
Cross-correlation analysis was then performed to evaluate the excitatory or inhib- 
itory nature of each neuron on other simultaneously recorder units. To assess the 
significance of cross-correlogram analyses performed between pairs of recorded 
neurons, a mean firing rate with 95% confidence limits of the target neuron firing 
rate was calculated. Significant short-latency inhibitory or excitatory interactions 
were retained if the number of action potentials of the target neuron was infe- 
rior or superior to these 95% confidence limits, respectively. Note that among 
the clusters of PNs and interneurons, no PNs were found to inhibit another cell 
and no interneurons were found to excite another cell. To identify the main firing 
patterns among PNs, we used an unbiased principal component analysis based on 
the neuronal activity evoked by CS* presentations (z-score 500 ms before and after 
CS* presentations, CS* presentations one to four in the post-fear-conditioning 
sessions, each CS* consisting of 27 individual sound pips; bin size of 10 ms). Only 
the first principal component was considered (PC1) because it explained most of 
the variance of our data set. One neuron was excluded from analysis on the motive 
of an absence of spiking activity around the time of CS* presentation. Neurons 
(n= 289) were classified as correlated, indifferent of inversely correlated with PC1 
at the P < 0.05 significance level. 

Neuronal assembly isolation. We hypothesized that single neurons of the dmPFC 
functionally aggregate into discrete assemblies to code for fear expression. We 
used a previously published method* to evaluate the possibility that a subset 
of PNs is coactivated during discrete time windows inside freezing epochs. We 
calculated single-unit rate histograms with a sliding window of 150 ms and 100 ms 
overlap, to minimize potential slicing of assembly realizations. In this study we were 
interested in identifying which neurons were coactivated rather than how much 
they were coactivated. We therefore focused on the sole presence or absence of 
activity for a neuron inside each time bin independently of its actual firing rate. 
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To this end, the rate histograms were binarized in such a way that any bin value 
strictly above zero was given a value of 1 (otherwise 0, Extended Data Fig. 3a, top). 
Binarized histograms for n simultaneously recorded PNs were then concatenated 
and we calculated the n x n coactivation matrix of the PN population for each 
time bin (Extended Data Fig. 3a, bottom). To investigate the emergence of specific 
coactivation pattern during the recording session, principal component analysis 
was performed on the coactivation matrix. We considered only the score on the 
first principal component (PC1) that explains the greatest part of the coactivation 
matrix variance. To extract putative coactivation matrix pattern related to fear 
expression, we then analysed the distribution of PC1 scores as a function of animal 
freezing behaviour. Note that we also investigated the impact of sliding window 
length on PCI score distributions inside and outside freezing epochs to define the 
optimal window length to isolate freezing-related neuronal assemblies. The optimal 
window length was found to be 150 ms, as shorter windows were not as efficient in 
discriminating freezing versus no-freezing epochs and larger windows were not 
further ameliorating that discrimination (Extended Data Fig. 3c). The probability 
of a certain PC1 score predicting freezing was analysed using a bootstrap method as 
follows. PC1 score was binned (0.1 units) and we calculated the freezing probability 
associated with each score bin. The same procedure was applied to a set of surro- 
gate data for which freezing intervals were shuffled 50 times. Actual and surrogate 
freezing probabilities as a function of PC1 score were then compared to retrieve 
the threshold above which PC1 score predicted freezing behaviour above chance 
level (red dotted line in Extended Data Fig. 3b, d~e, h-m). This threshold was 
then used to isolate coactivation matrix patterns predicting freezing behaviour 
(hereafter named coactivation matrix freezing patterns). Finally, we identified and 
analysed which single neurons were coactive inside coactivation matrix freezing 
patterns associated with freezing behaviour (Extended Data Fig. 3f, top). Note that 
the weight of a neuron within the coactivation matrix freezing pattern can be solely 
due to its random probability of spiking (for example, neurons that increase firing 
during freezing would be more likely to co-fire). To correct for this potential bias, 
we tested the significance of each single neuron contribution to the coactivation 
matrix freezing pattern against that of a surrogate data set (50 time shuffling of 
the spike train inter-spike intervals, Extended Data Fig. 3f, bottom). The aver- 
age coactivation matrix was normalized by dividing it by the average surrogate 
coactivation matrix (Fig. le, all other examples see Extended Data Fig. 3h-m). Each 
neuron was then given an assembly participation index (AP index) defined as the 
ratio between its average and surrogate firing rates for coactivation matrix freezing 
patterns (Fig. 1d, e). Each neuron displaying an AP index significantly above the 
chance level (chance: AP index = 1) defined by the confidence interval of surrogate 
data was considered as an AN (AP index > 1). Units with an AP index inferior or 
equal to chance level were classified as ONs (ON, AP index < 1). 

LFP analyses. LFPs were analysed using custom scripts from Matlab, as well as 
an open source Matlab toolbox for the analysis of circular statistics*”. LFP signals 
were filtered in the 4 Hz range (3-6 Hz) using a second-order Butterworth filter. To 
evaluate 4 Hz phase locking of individual neurons, we calculated the instantaneous 
phase of the 3-6 Hz filtered LFPs using the Hilbert transform. For a given neuron, 
each spike was assigned its corresponding LFP 4 Hz phase value from the LFP 
signal recorded on the same wire. Phase locking was calculated using Rayleigh’s test 
for circular uniformity and statistical significance was assessed using Rayleigh’s test 
Pand z values. For significantly phased-locked PNs, we quantified the preferred 
phase as well as phase locking strength using the MRL. Importantly, MRL estimate 
is highly dependent on the sample size. To account for this bias, we computed MRL 
only for significantly phase-locked neurons displaying at least 50 spikes. Therefore, 
at least the relative comparisons of 4 Hz modulation strength between conditions 
should not be affected by sample sizes (number of spikes). For time-frequency 
analysis, we used wavelet decomposition of LFP signals. LFPs were convolved by a 
family of Morlet’s wavelets, one for each frequency between 2 and 16 Hz (with a log, 
scale) as a function of time. Power spectral density for freezing and non-freezing 
epochs was retrieved by averaging the results of the time-resolved wavelet analysis 
across those specific time intervals. To average LFP power across animals, we first 
normalized the power spectral density histogram as the percentage of total power 
between 2 and 16 Hz. To compare @ (8-12 Hz) and 4 Hz frequencies in freezing 
and non-freezing epochs we calculated the 4 Hz to 6 ratio: 


SD PSDar, — S PSDy 
32 PSDaze + © PSDs 


We addressed the correlation in time between freezing episode onset and offset 
and LFP oscillations in the 2-16 Hz frequency range by calculating the average 
peri-event time spectrograms of dmPFC LFP around the onset or offset of freezing 
epochs. We then addressed the putative relationship between neuronal assembly 
occurrence and the modulation of certain frequency components in the LFP at the 
time of freezing onset and offset. For this we computed a linear correlation between 
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assembly rate (occurrence per second) and LFP power at each frequency between 
2 and 16 Hz. To assess 4 Hz and assembly dynamics inside freezing and no freezing 
episodes, we normalized the length of freezing and no-freezing intervals to express 
time as the percentage of time elapsed inside each interval. 

To evaluate the average pairwise co-firing between pairs of PN as a function of 
4 Hz phase, spike trains were binned according to LFP 4-Hz phase (36° bin size), 
and for each bin class the pairwise co-firing index was calculated as the ratio of 
co-occurring (common) spikes to the total number of spikes in the bin class for 
the two units. This provides a simple yet direct measure of the co-occurrence of 
unit spikes as a function of phase. To ensure the pairwise co-firing of AN neurons 
was not artificially enhanced by (1) the temporal relation between pairs of AN 
neurons or (2) the cycle-to-cycle relation between AN neurons (phase relation), 
we computed a shuffled co-firing (1) and phase-shift predictor (2) as follows. For 
the shuffled co-firing analysis, for each pair of PNs, one spike train inter-spike 
intervals were time-shuffled 50 times and the mean surrogate co-firing for each 
phase bin value recalculated. We then averaged over all AN or ON neuron pairs. 
This analysis conserves the overall firing rate of the shuffled spike train but destroys 
the temporal relationship between the two spike trains. Since ANs and ONs present 
different firing rates and different modulations of firing rates in relation to freezing 
(Extended Data Fig. 4a), this correction allows the comparison of co-firing between 
both populations. For the shift predictor analysis, for each 36° phase bin and each 
pair of AN neurons, 4 Hz cycles were given an index from 1 to n, n corresponding 
to the total number of 4 Hz cycles in the recording. Indices were then shuffled 
50 times for the reference neuron and the mean surrogate co-firing was calculated. 
This analysis destroys the cycle-to-cycle relation between the two spike trains 
while preserving the firing modulation by behaviour. We then calculated the 
corrected pairwise co-firing by subtracting both the shuffled co-firing and the shift 
predictor to the actual co-firing and subsequently evaluated the sole contribution of 
freezing or phase to the co-firing of PN. In addition we assessed the impact of the 
instantaneous power of 4 Hz oscillations on PN co-firing. To this end we calculated 
pairwise co-firing as a function of phase for different classes of 4 Hz power. Power 
during freezing was z-scored and binned (0.49 s.d.) and pairwise co-firing was 
calculated as a function of phase and power. The analysis was restricted to the 
—1.96 s.d. to +1.96 s.d. power range (eight bins) encompassing 95% of power 
values to exclude extreme values with low occurrence. To compare ANs and ONs 
on the way their phase-specific pairwise co-firing is impacted by 4 Hz power 
change, we calculated the difference in corrected co-firing between ascending and 
descending phases as follows: ACF = max(CF,scending)/Min(CFgescending)- 
Closed-loop stimulation. Neurons forming a functional assembly cannot be 
specifically manipulated using a standard optogenetic approach. Although such 
cellular specificity is unattainable to this day, a certain temporal specificity exists 
as the activation of fear coding neuronal assemblies is specific of 4 Hz ascending 
phase. To apply our stimulation in a functionally specific manner, we performed 
optogenetic inhibition of PNs as a function of the phase (ascending or descending) 
of the ongoing 4 Hz oscillation in dmPFC LFP. To that end we designed a closed- 
loop stimulation protocol where animal behaviour (freezing) and dmPFC LFP 
(4 Hz power and phase) were monitored online and simultaneously analysed to 
drive a laser beam. Animal position was sampled at 80 Hz and LFP at 1 KHz, then 
uploaded every 10 ms in Matlab for online analysis. The former served, on the 
one hand, to quantify animal speed in the last 1,000 ms of uploaded signal and 
on the other hand to bandpass filter the LFP signal (3-6 Hz range) and retrieve 
both 4 Hz power in the last 500 ms and 4 Hz phase for the last recorded data point. 
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Animal speed was calculated as the average distance travelled over the last 1,000 ms 
time window and compared with the freezing threshold. Bandpass filtering was 
applied using a second-order Butterworth filter and Hilbert transform was used to 
estimate both 4 Hz power in the last 500 ms and LFP instantaneous phase for the 
last data point. Three conditions were defined for the stimulation. For the most 
recent 10 ms upload if (1) animal speed was below freezing threshold, (2) 4Hz 
power was above that of baseline level and (3) 4Hz phase was encompassed within 
the range of choice (ascending, 0-180°; or descending, 180-360°), then stimulation 
was triggered. The stimulation consisted of a 1 ms pulse sent from Matlab to a pulse 
generator (Master 9, AMPI) that in turn sent a 30 ms pulse to a blue/yellow laser 
generator to deliver optical stimulation to the dmPFC. The whole computation 
from neuronal data read to laser onset was achieved in 30 ms maximum (data 
retrieval 10 ms; computation 5-20 ms). Hence, at the end of the 30 ms light pulse, 
anew analysis loop was completed and if the three conditions were again fulfilled 
in the most recent 10 ms upload then a new stimulation pulse was triggered. 
Statistical analyses. For each statistical analysis provided in the manuscript, the 
Kolmogorov-Smirnov normality test was first performed on the data to determine 
whether parametric or non-parametric tests were required. Two different 
approaches were used to calculate the sample size. For studies in which we had 
sufficient information on response variables, power analyses were performed to 
determine the number of mice needed. For studies in which the behavioural effect 
of the manipulation could not be pre-specified, such as optogenetic experiments, 
we used a sequential stopping rule. In essence this method enables null-hypothesis 
tests to be used in sequential stages, by analysing the data at several experimental 
points using t-tests. Usually the experiment started by testing only a few animals 
and if the P value was below 0.05, the investigator declared the effect significant 
and stopped testing. If the P value was greater than 0.36, the investigator stopped 
the experiment and retained the null hypothesis. For sample size estimation using 
power analyses, we used an online power analysis calculator (G*power 3). For each 
analysis, sample size was determined using a power > 0.9 and alpha error = 0.05. 
All tests were two sided. Sample size determination using sequential stopping rule 
analyses were used for optogenetic experiments in which it was not possible to 
determine a priori the effect of the optical manipulation. We used P values of 0.05 
and 0.36 for lower and upper criteria. Using this strategy, we ended up with a value 
of n comprising between five and seven animals per group. No randomization or 
investigator blinding was done. 
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Extended Data Figure 1 | CS+-triggered and spontaneous occurrence 
of conditioned freezing responses. a, Representative examples of 
conditioned freezing behaviour periods recorded 24h after auditory 

fear conditioning (post-FC session). Freezing epochs occurred either 
independently of any sensory stimulation (thick grey lines) or were 
induced by CS* presentations (thick black lines). b, Averaged percentage 
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of freezing behaviour recorded 24h after auditory fear conditioning 
(post-FC session) within or outside CS* presentations (n=7 mice, paired 
t-test, within versus outside CS*, **P < 0.01). ¢, Averaged percentage 

of freezing episodes initiated outside or within CS* presentations and 
restricted or outlasting CS* presentations (n= 7 mice). Error bars, 

mean = s.e.m. 
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Extended Data Figure 2 | Separation of putative PNs and putative 
interneurons. a, Location and depth of recording sites in the dmPFC 
and number of neurons recorded (n =7 mice). b, Top: superimposed 
waveforms recorded from three different units. Bottom: spikes originating 
from individual units were sorted using three-dimensional principal 
component analysis. c, Among the population of neurons recorded, 

86% were classified as putative projection neurons (PN, red circles, 

n= 290) and 14% as putative interneurons (IN, blue circles, n = 48) 
using an unbiased unsupervised cluster separation algorithm based on 
three electrophysiological properties: firing frequency, spike half width 
(SHW) and spike area under waveform peak. Inset, average waveform of 
a representative PN and interneuron illustrating the methodology used 
to quantify SHW and the spike segment used to calculate the spike area 
under waveform. d, Top: representative cross-correlogram performed 


between a PN and interneuron showing a short-latency, presumably 
monosynaptic, excitatory interaction (no inhibitory interaction 
identified among all significant cross-correlograms identified). Bottom: 
representative cross-correlogram between an interneuron and a PN 
showing a short-latency, possibly monosynaptic, inhibitory interaction 
(no excitatory interaction identified among all significant cross- 
correlograms identified). Reference events correspond to the spikes of the 
presynaptic neuron (dashed line at time 0, bins of 0.5 ms). Red and blue 
circles represent PNs and interneurons, respectively. e, Box plot of the 
firing rate of PNs (n = 290 in 7 mice) and interneurons (n = 48 in 7 mice). 
For each box plot, the middle, bottom and top lines correspond to the 
median, lower quartile and upper quartile firing rates. Blue and red dots 
below and above the boxes correspond to outlier data points. 

f, Corresponding firing frequency statistics for PNs and interneurons. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Isolation of dmPFC neuronal assemblies. 

a, Top: example of a binarized rate histogram matrix computed from 

55 PNs recorded in a single animal. Bottom: examples of correlation 
matrices computed for particular time bins from the same 55 PNs (bin 
size: 150 ms). b, Corresponding example of principal component scores 
for the same population of 55 PNs and freezing behaviour as a function 
of time. For the displayed component (first principal component), the 
distribution of scores is different for freezing and no freezing periods. 
This shows that freezing episodes are associated with a specific profile of 
activity for the simultaneously recorded neurons taken as a population. 
c, Averaged Mahalanobis distance between neuronal population profiles 
correlated with freezing and no freezing epochs as a function of the length 
of the sliding window used to produce the binarized rate histograms 
(see Methods). The optimal separation for neuronal population profiles 
correlated with freezing and no freezing epochs was obtained with 
sliding window lengths between 150 and 250 ms (n =7 mice, Friedman 
repeated-measures one-way ANOVA on ranks test, *P < 0.05). Error 
bars, mean + s.e.m. d, Corresponding example distribution of principal 
component scores for neuronal population profiles correlated with 
freezing (black dots) and no freezing (white dots) epochs (Wilcoxon 
signed rank test, P < 0.05). e, Corresponding example of freezing 
probability as a function of neuronal population principal component 
score. Freezing probability associated with each score is compared with 
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surrogate data (black dots) for which freezing intervals have been shuffled. 
Plain and open red dots represent neuronal population profiles that are 
significantly over-represented compared with chance during freezing and 
no freezing periods, respectively (one sample t-test, surrogate data versus 
actual freezing probability for each score, for scores equal to or above 

1.15, freezing probability superior to chance level and P < 0.05). The red 
dotted line represents the boundary between neuronal population profiles 
correlated with freezing and no freezing epochs. f, Top: coactivation 
matrix averaged over freezing epochs from the same 55 dmPFC PNs 
recorded in a single animal. The strength of the coactivation between 
pairs of neurons is expressed as the percentage of coactivation compared 
with chance. Bottom: surrogate data. Coactivation matrix averaged over 
freezing epochs from the same 55 dmPFC PNs recorded in a single animal 
for which spike trains have been shuffled 50 times. g, Average percentage 
of ANs (AP index > 1) and ONs (AP index < 1) to freezing-activated 
neuronal assemblies (n = 7 mice, ON: 59.6+4.5%; AN: 40.4+ 4.5%). 
h-m, Left: example of freezing probability as a function of neuronal 
population principal component score for individual mice as in e. Right: 
normalized coactivation matrix averaged over freezing predicting epochs 
from dmPFC PNs recorded in individual animals (n = 29, 40, 46, 26, 45 
and 49 neurons). The strength of the coactivation between pairs of neuron 
is expressed as the percentage of coactivation compared with chance. Error 
bars, mean + s.e.m. 
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Newman-Keuls post-hoc test, *P < 0.05). ¢, Z-score-normalized ANOVA on ranks, P > 0.05). Error bars, mean +s.e.m. 
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Extended Data Figure 5 | Histological controls and closed-loop 
stimulation technique. a, Representative micrograph used for dmPFC 
GFP expression assessment. b, Representative dmPFC PV interneurons 
with distinct dendritic arborizations were infected. Left: multipolar 
neuron with a round soma, corresponding to a putative basket cell. 

Right: interneuron with an ovoid soma at the layer 1-2 border, displaying 
asymmetric and tufted dendrites. Some branches extend towards the 

pia, bend and follow the pial surface. This neuron corresponds to the 
typical description of axo-axonic (chandelier) cells in mPFC. ¢, Top: 
representative raw (black trace) and filtered (grey trace, 3-6 Hz) dmPFC 
LFP recorded in PV-IRES-Cre mice during 4 Hz oscillations (top) or in 
CAMKI-Cre mice outside 4 Hz oscillations infected with ChR2 (bottom) 
upon light activation (30 ms square pulses; black line, pulse start; red bar, 
pulse length). d, e, Averaged LFP power spectra obtained in ChR2-infected 
mice and GFP controls during stimulated (coloured traces) and non- 
stimulated (grey traces) periods when the stimulation was applied in the 


ascending (ChR2, n=7 mice; GFP, n=7 mice) or descending (ChR2, n=7 
mice; GFP, n=7 mice) phase. f, Averaged 3-6 Hz LFP power for ChR2 
and GFP mice during stimulation in the ascending or descending phase 
of 4 Hz oscillations expressed as a percentage of no stimulation periods 
(ChR2, n=7 mice; GFP, n=7 mice; two-way ANOVA, factor 1: GFP 
versus ChR2, P = 0.848; factor 2: descending versus ascending, P = 0.879; 
F, x F,, P=0.618). g, h, Averaged LFP power spectra obtained in ArchT 
infected mice and GFP controls during stimulated (coloured traces) and 
non-stimulated (grey traces) periods when the stimulation was applied in 
the descending (ArchT, n=5 mice; GFP, n=7 mice) or ascending phase 
(ChR2, n=5 mice; GFP, n= 4 mice). i, Averaged 3-6 Hz LFP power for 
ArchT and GFP mice during stimulation in the descending or ascending 
phase of 4 Hz oscillations expressed as a percentage of no stimulation 
periods (ArchT, n=5 mice; GFP, n= 7/4 mice; two-way ANOVA, factor 1: 
GFP versus ChR2, P = 0.872; factor 2: descending versus ascending, 
P=0.183; F, x F,, P=0.5). Error bars, mean +s.e.m. 
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Extended Data Figure 6 | Phase-specific optogenetic inhibition of 
dmPFC PNs. a, Twenty-four hours after conditioning, levels of freezing 
in wild type (n =7) and GFP (n= 7) control animals where similar (2 min 
baseline, four CS* and four post-CS* periods of 30s averaged, two-way 


ANOVA, factor 1: wild type versus GFP, P > 0.05; factor 2: time, P < 0.001; 


F, x Fy, P < 0.01, Student-Newman-Keuls post-hoc test within wild 
type and GFP, ***P < 0.001). b, Corresponding time-resolved changes 
in freezing behaviour around CS* offset for wild type and GFP control 
animals (two-way ANOVA, factor 1: wild type versus GFP, P > 0.05; 
factor 2: time, P > 0.05; F, x Fy, P > 0.05). c, Comparison of GFP 
control levels of freezing during (CS*) and 30s after CS* (post-CS*) 

on days 2 (n=7), 3 (n=7) and 4 (n= 4) (2 min baseline, four CS* and 
four post-CS* periods of 30s averaged; two-way ANOVA, factor 1: day, 
P > 0.05; factor 2: time, P < 0.001; F; x F2, P > 0.05; Bonferroni 
post-hoc test between epochs, **P < 0.01, ***P < 0.001). d, Timeline 
scheme for the optogenetic stimulation protocol on day 2 with stimulation 
targeting the ascending phase of 4 Hz oscillation cycle (top) and day 3 
with stimulation targeting the descending phase of 4 Hz oscillation cycle 
(bottom). e, f, Comparison of freezing levels between CS* 1 and CS* 2-4 


for epochs 10s preceding, during and 30s after CS* during optogenetic 
stimulation (pre-CS*, CS* and post-CS* respectively) on day 2 (e) and 
day 3 (f). On day 2, ascending phase stimulation after CS* 1 induced a 
decrease of freezing levels during subsequent CS* epochs as no more 
difference was observed between pre CS*, CS* and post CS* for 

CS* 2-4 compared with CS* 1 (n=7 mice, repeated-measures 

two-way ANOVA, factor 1: CS* 1 versus CSt 2-4, P > 0.05; factor 2: 
epoch, P < 0.05; F; x Fo, P > 0.05; Student-Newman-Keuls post-hoc test 
within factor 1, *P < 0.05 compared with both pre- and post-CS* epochs). 
On day 3, descending phase stimulation after CST 1 induced an increase in 
freezing levels during pre-CS* epochs as no more difference was observed 
between pre-CS*, CS* and post-CS* for CS* 2-4 compared with CS* 1 
(n=7 mice, repeated-measures two-way ANOVA, factor 1: CS* 1 versus 
CSt 2-4, P > 0.05; factor 2: epoch, P < 0.05; F, x Fy, P < 0.05; Student- 
Newman-Keuls post-hoc test within factor 1, *P < 0.001 compared with 
both CS* and post-CS* epochs). g, Averaged freezing behaviour evoked 
by CS* 1 presentation on day 3 if day 2 stimulation was in the ascending 
phase (GFP n=7, ChR2 with day 2 ascending n = 7, Mann-Whitney rank 
sum test, P=0.259). 
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Extended Data Figure 7 | Phase specificity of the close loop stimulation 
technique. a, Representative distribution of stimulation pulses in relation 
to the phase of the 4 Hz oscillation (black line, filtered LFP 3-6 Hz) when 
the stimulation was applied in the ascending phase of the oscillation (bins 
of 18°). b, Representative cross-correlogram between two dmPFC PNs 
(neurons A and B) participating to neuronal assemblies during (blue bars), 
or outside (grey bars) optogenetic inhibition of PV interneurons. Note the 
reduction in joint firing activity when the stimulation was on. Reference 
events correspond to the spikes of the neuron A (dashed line at time 0, bins 
of 5 ms). c, Firing frequency of dmPFC neurons A (top) and B (bottom) 
without optogenetic stimulation (grey bars, bins of 15 ms) or when the 
pulse was applied in the ascending phase of the 4 Hz oscillation (blue 

bars, bins of 15 ms) (black line, filtered LFP 3-6 Hz). d, Corresponding 
joint peri-stimulus time histograms performed between the same dmPFC 
neurons A and B when the pulse was applied in the ascending phase 

of the 4 Hz oscillation (top) or without stimulation (bottom). Note the 
inhibition of the co-firing activity when the stimulation was on. e, Firing 
probability per degree of identified PV* interneurons as a function of 


4Hz oscillations phase (n = 14 PV* interneurons). f, Cumulative firing 
probability per degree for individual PV~ interneurons (open circles) 
during the ascending or descending phase of 4 Hz oscillations and 
corresponding averaged cumulative firing probability (black bars, n= 14 
PV* interneurons, Wilcoxon signed rank test, P= 0.733). g, i, Top: firing 
of a PV* interneuron in a mouse expressing ChR2 or ArchT in PV* 
interneurons in the dmPFC in response to blue or yellow light (light-pulse 
duration 30 ms; 150 trials). Bottom: mean peri-stimulus time histograms 
of all PV* interneurons displaying significant light-evoked excitation or 
inhibition (n= 10 PV* from five mice infected with ChR2; n =4 PV* 
from two mice infected with ArchT, light-pulse duration, 30 ms). Bins of 

5 ms. h, j, Top: firing of a PN in a mouse expressing ChR2 or ArchT in PV* 
interneurons in the dmPFC in response to blue or yellow light activation 
of PV* interneurons (light-pulse duration, 30 ms; 150 trials). Bottom: 
mean peri-stimulus time histograms of all PNs displaying significant light- 
evoked inhibition or excitation (n = 160 PNs from seven mice infected 
with ChR2; n = 117 PNs from five mice infected with ArchT; light-pulse 
duration, 30 ms). Bins of 5 ms. Error bars, mean + s.e.m. 
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Extended Data Figure 8 | Effect of phase-specific PV manipulation on 
dmPFC co-firing activity. a, b, Top: corrected pairwise co-firing for pairs 
of dmPFC neurons recorded in mice infected with ChR2 or GFP during 
stimulation in the ascending (a, ChR2, n = 2,877 pairs; GFP, n = 2,067 
pairs) or descending (b, ChR2, n = 2,067 pairs; GFP, n = 2,282 pairs) phase 
of 4 Hz oscillations. Bottom: averaged delta-corrected pairwise co-firing 
activity (delta = ChR2 minus GFP) as a function of 4 Hz phase (one sample 
t-test with Bonferroni correction, hypothetical mean = 0, **P < 0.01, 
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***P < 0,001). c, d, Top: corrected pairwise co-firing for pairs of dmPFC 
neurons recorded in mice infected with ArchT or GFP during stimulation 
in the ascending (a, ArchT, n= 2,389 pairs; GFP, n = 2,542 pairs) or 
descending (b, ArchT, n = 3,122 pairs; GFP, n = 2,282 pairs) phase of 4 Hz 
oscillations. Bottom: averaged delta-corrected pairwise co-firing activity 
(delta = ArchT minus GFP) as a function of 4 Hz phase (one-sample t-test 
with Bonferroni correction, hypothetical mean = 0, *P < 0.05, **P < 0.01, 
*** DP < 0,001). Error bars, mean +s.e.m. 
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Extended Data Figure 9 | Phase-specific optogenetic disinhibition 

of dmPFC PNs enhanced fear behaviour. a, Timeline scheme for the 
optogenetic stimulation protocol on day 3 with stimulation targeting the 
descending phase of 4 Hz oscillation cycle (top) and day 4 with stimulation 
targeting the ascending phase of 4 Hz oscillation cycle (bottom). 

b, d, Comparison of freezing levels between CS* 1 and CS* 2-4 for epochs 
10s preceding, during and 30s after CS* during optogenetic stimulation 
of PV-IRES-Cre mice infected within the dmPFC with ArchT (pre-CS*, 
CS* and post-CS* respectively) on day 3 (b) and day 4 (d). On day 3, 
descending phase stimulation after CS* 1 had no effect on freezing levels 
during subsequent CS* epochs before, during or after CS* (n=5 ArchT 
mice, repeated-measures two-way ANOVA, factor 1: CS* 1 versus CSt 
2-4, P > 0.05; factor 2: epoch, P < 0.05; Fy x Fy, P > 0.05; Student- 
Newman-Keuls post-hoc test within factor 1, **P < 0.01 compared with 
both pre- and post-CS* epochs, *P < 0.05 comparing pre-CS with CSt 
epochs). On day 4, ascending phase stimulation after CS* 1 had no effect 
on freezing levels during subsequent CS* epochs before, during or after 
CS* (n=5 ArchT mice, repeated-measures two-way ANOVA, factor 1: 


CS* 1 versus CS* 2-4, P > 0.05; factor 2: epoch, P < 0.05; F, x Fh, 

P< 0.05; Student-Newman-Keuls post-hoc test within factor 1, 

** P< (0.001 compared with both CS* and post-CS* epochs, *P < 0.05 
comparing pre-CS* with CS* epochs). ¢, e, Left: time-resolved changes in 
freezing behaviour during light-mediated inhibition of dmPFC neuronal 
assemblies in the descending (c) or ascending (e) phase of the 4 Hz 
oscillation in PV-IRES-Cre mice infected within the dmPFC with ArchT 
(descending phase: first CSt, n = 5; ascending phase: first CSt, n=5) or 
GFP (descending phase: first CS*, n = 7; ascending phase: first CSt, n = 4; 
descending phase: two-way ANOVA, factor 1: ArchT versus GFP, P > 0.05; 
factor 2: time, P > 0.05; F; x F2, P > 0.05; ascending phase: two-way 
ANOVA, factor 1: ArchT versus GFP, P < 0.01; factor 2: time, P < 0.01; 

F, x Fy, P < 0.01). Shaded area represents the period during which the 
light stimulation was applied. Right: averaged freezing behaviour during 
the stimulation in the descending (c) (ArchT n=5; GFP n=7, Mann- 
Whitney test, ArchT versus GFP, P > 0.05) or ascending phase (e) (ArchT 
n=5; GFP n=4, Mann-Whitney test, ArchT versus GFP, *P < 0.05). 
Error bars, mean + s.e.m. 
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phase distribution of ONs provides downstream structures with a over a cycle are increased. This does not affect SNR and freezing remains 
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expression. b, During optogenetic PN inhibition this SNR is altered. Top: emitter whose signal transmission conditions the SNR for the receiver and 
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cycle is shut down. This dramatically reduces SNR and prevents freezing phase specificity for dmPFC neuronal assemblies shows that SNR can be 
expression. Bottom: when stimulating in the descending phase, AN manipulated to either. 
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Unexpected role of interferon--‘ in regulating 
neuronal connectivity and social behaviour 


Anthony J. Filiano!’, Yang Xu’, Nicholas J. Tustison*, Rachel L. Marsh!?, Wendy Baker!?, Igor Smirnov!?, 


1,2 


Christopher C. Overall!?, Sachin P. Gadani!?>-°, Stephen D. Turner’, Zhiping Weng®, Sayeda Najamussahar Peerzade’, 
Hao Chen, Kevin S. Lee*>°, Michael M. Scott®!°, Mark P. Beenhakker®!”, Vladimir Litvak** & Jonathan Kipnis!?>-°* 


Immune dysfunction is commonly associated with several 
neurological and mental disorders. Although the mechanisms by 
which peripheral immunity may influence neuronal function are 
largely unknown, recent findings implicate meningeal immunity 
influencing behaviour, such as spatial learning and memory!. 
Here we show that meningeal immunity is also critical for social 
behaviour; mice deficient in adaptive immunity exhibit social 
deficits and hyper-connectivity of fronto-cortical brain regions. 
Associations between rodent transcriptomes from brain and cellular 
transcriptomes in response to T-cell-derived cytokines suggest 
a strong interaction between social behaviour and interferon-y 
(IFN-+y)-driven responses. Concordantly, we demonstrate that 
inhibitory neurons respond to IFN-y and increase GABAergic 
(\-aminobutyric-acid) currents in projection neurons, suggesting 
that IFN-+ is a molecular link between meningeal immunity and 
neural circuits recruited for social behaviour. Meta-analysis of the 
transcriptomes of a range of organisms reveals that rodents, fish, 
and flies elevate IFN-+/JAK-STAT-dependent gene signatures 
in a social context, suggesting that the IFN-+ signalling pathway 
could mediate a co-evolutionary link between social/aggregation 
behaviour and an efficient anti-pathogen response. This study 
implicates adaptive immune dysfunction, in particular IFN-y, 
in disorders characterized by social dysfunction and suggests a 
co-evolutionary link between social behaviour and an anti-pathogen 
immune response driven by IFN-y signalling. 

Social behaviour is beneficial for many processes critical to the 
survival of an organism, including foraging, protection, breeding, and, 
for higher-order species, mental health”. Social dysfunction manifests 
in several neurological and mental disorders such as autism spectrum 
disorder, frontotemporal dementia, and schizophrenia, among 
others‘. Likewise, imbalance of cytokines, a disparity of T-cell subsets, 
and overall immune dysfunction is often associated with the above- 
mentioned disorders*’. However, the fundamental mechanism(s) by 
which dysfunctional immunity may interfere with neural circuits and 
contribute to behavioural deficits remain unclear. 

To test whether adaptive immunity is necessary for normal social 
behaviour, we tested SCID mice (deficient in adaptive immunity) 
using the three-chamber sociability assay* (Extended Data Fig. 1a). 
This assay quantifies the preference of a mouse for investigating a novel 
mouse versus an object, and has been used to identify deficits in mul- 
tiple mouse models of disorders that present with social dysfunction’. 
Unlike wild-type mice, SCID mice lacked social preference for a mouse 
over an object (Fig. la). Importantly, SCID mice did not show anxiety, 


motor, or olfactory deficits (Extended Data Fig. 1b-j). We confirmed 
that SCID mice have social deficits by analysing social interactions in 
a home cage (Extended Data Fig. 1k). To test whether social deficits 
were reversible, we repopulated 4-week-old SCID mice with wild-type 
lymphocytes (Extended Data Fig. 11-n) and measured social behaviour 
4 weeks after transfer. SCID mice repopulated with lymphocytes, unlike 
those injected with the vehicle, showed social preference indistinguish- 
able from wild-type mice (Fig. 1b). 

Recent clinical findings indicate that disturbed circuit homeostasis, 
resulting in hyper-connectivity, is a feature of children with autism 
spectrum disorder!®. Imaging studies using task-free resting-state {MRI 
(rsf{MRI), revealed hyper-connectivity among frontal cortical nodes in 
patients with autism spectrum disorder’’. Disturbances in resting state 
connectivity are also observed in mice with social deficits’”. rsfMRI is 
an unbiased technique used to assess synchrony between brain regions 
over time by comparing spontaneous fluctuations in blood oxygenation 
level-dependent (BOLD) signals!’. To assess the influence of adaptive 
immunity on functional connectivity, we analysed resting-state BOLD 
signals from wild-type and SCID mice (Extended Data Fig. 2a). SCID 
mice exhibited hyper-connectivity between multiple frontal and 
insular regions (Fig. 1c, d, Extended Data Fig. 2b and Supplementary 
Table 1) implicated in social behaviour and autism spectrum disorder. 
Notably, repopulating SCID mice with lymphocytes rescued aberrant 
hyper-connectivity observed in vehicle-treated SCID controls (Fig. 1c, d 
and Extended Data Fig. 2b). Interestingly, other functionally connected 
regions, not directly implicated in social function, such as interhemi- 
spheric connectivity between motor and somatosensory cortex, were 
not affected by a deficiency in adaptive immunity (Supplementary 
Table 1). Using another approach to analyse neuronal activation in 
a task-based system, we demonstrated that SCID mice exposed to a 
social stimulus exhibited hyper-responsiveness in the prefrontal cortex 
(PFC; increased number of c-fos* cells in PEC; Fig. le, f) but not the 
hippocampus (Extended Data Fig. 2c). 

We previously demonstrated that T cells influence learning behaviour 
and exert their beneficial effects presumably from the meninges! !*. 
To address the role of meningeal T cells in social behaviour, 
we decreased the extravasation of T cells into the meninges of wild-type 
mice using antibodies against VLA4 (ref. 15), an integrin expressed on 
T cells (among other immune cells) required for CNS homing. 
Partial elimination of T cells from meninges (Extended Data Fig. 3) 
was sufficient to cause a loss in social preference (Fig. 1g). Despite 
their proximity to the brain, meningeal T cells do not enter the brain 
parenchyma, suggesting their effect is mediated by soluble factors. To 
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Figure 1 | Meningeal T-cell compartment is necessary for supporting 
neuronal connectivity and social behaviour. a, Wild-type mice 

exhibit social preference absent in SCID mice (ANOVA for genotype 
(Fi,26) = 6.370, P= 0.0181; n= 14 mice per group; **P < 0.01, Sidak’s 
post hoc test; pooled two independent experiments). b, Repopulating the 
adaptive immune system in SCID mice restored normal social behaviour 
(n=17, 16, 15 mice per group; ANOVA for genotype (F2,45) = 8.282, 
P=0.0009 and interaction (F245) = 9.146, P=0.0005; ***P < 0.001; 

**P <(.01, Sidak’s post hoc test; pooled three independent experiments). 
c, Correlation matrices from wild-type, SCID, and repopulated (Repop.) 
SCID mice were generated by rsfMRI. L, left; R, right; FrA, frontal 
association area; PrL, prelimbic cortex; Ins, insula; OrbC, orbital cortex. 
d, Correlation values from rsfMRI. Box and whisker plots extend to the 
25th and 75th percentiles with the centre-line showing the mean. Whiskers 
represent the minimum and maximum data points (n = 8, 9, 4 mice 

per group; ANOVA < 0.05; *P < 0.05, Sidak’s post hoc test; pooled two 
independent experiments). e, Immunohistochemistry of c-fos in the PFC. 


identify which T-cell-mediated pathways are involved in regulating 
social behaviour, we used gene set enrichment analysis (GSEA) to 
search for T-cell-mediated response signatures (IFN-y, IL-4/IL-13, 
IL-17, Il-10, TGF-) in 41 transcriptomes from mouse and rat brain 
cortices. GSEA assesses whether the expression of a previously defined 
group of related genes is enriched in one biological state. In this case, 
we used GSEA to identify which cytokine-induced response signatures 
were enriched in the transcriptomes of mice and rats exposed to 
different stimuli. The stimuli included social aggregation, sleep 
deprivation, stress, psychostimulants, antidepressants, anticonvulsants, 
and antipsychotics. Transcriptomes from cortices of animals exposed 
to social aggregation and psychostimulants were enriched for IFN- 
regulated genes (Fig. 1h, Extended Data Fig. 4 and Supplementary 
Tables 2 and 3). 

A substantial number of meningeal T cells are capable of expressing 
IFN-7 (41.95% + 6.34 of TCR* cells; Extended Data Fig. 5a) and recent 
work has proposed a role for IFN-+ in T-cell trafficking into meningeal 
spaces!®, To assess the potential role of IFN-+ in mediating the influence 
of T cells on social behaviour, we first examined the social behaviour 
of IFN-7\-deficient mice and determined that they had social deficits 
(Fig. 2a). Importantly, IFN-y-deficient mice did not show anxiety 
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f, Elevated numbers of c-fos* cells in the prefrontal and orbital cortices 

of SCID compared with wild-type mice (n = 9;10 mice per group; 

**P < 0.01; *P < 0.05, t-test; single experiment). g, Acute partial depletion 
of meningeal T cells caused social deficits (n = 12, 13 mice per group; 
ANOVA for interaction (F),23) = 7.900, P< 0.01; ***P < 0.001, Sidak’s post 
hoc test; pooled two independent experiments). h, Circos plot showing the 
connectivity map derived from the pairwise comparison of transcriptome 
data sets. The representations of IFN-y, IL-4/IL-13, IL-17, and IL-10/ 
TGEF-B data set connectivity are shown in orange, green, blue, and purple, 
respectively. Each line represents a pairwise data set overlap, which was 
determined using GSEA analysis and filtered by P< 0.05 and normalized 
enrichment score (NES) > 1.5. See Extended Data Fig. 5 for labels. Data 
from the three-chamber test (a, b, g) were analysed by applying a two- 

way ANOVA for social behaviour and genotype/treatment, followed by 
Sidak’s post hoc test. Bars, mean times investigating + s.e.m. Antidepress., 
antidepressants; anticonvuls., anticonvulsants; antipsych., antipsychotics. 


or motor deficits (Extended Data Fig. 5b-g). Similar to SCID mice, 
IFN-7-deficient mice also exhibited aberrant hyper-connectivity in 
fronto-cortical/insular regions (Fig. 2b, cand Supplementary Table 1). 
Although repopulating SCID mice with lymphocytes from wild- 
type mice restored a social preference, repopulating SCID mice with 
lymphocytes from Ifng~/~ mice did not have such an effect (Extended 
Data Fig. 6a). Remarkably, a single injection of recombinant IFN-7 into 
the cerebrospinal fluid (CSF) of Ifng-/~ mice was sufficient to restore 
their social preference when tested 24 h after injection (Fig. 2d) and 
reduce overall hyper-connectivity in the PFC (Extended Data Fig. 6b). 
To further validate a role for IFN-7y signalling in social behaviour, 
we tested mice deficient for the IFN-+ receptor (Ifngr1~'~ mice) and 
found that they hada similar social deficit as observed in Ifng-'~ mice 
(Extended Data Fig. 6c), which, as expected, was not rescued by injecting 
recombinant IFN-7 into the CSF (Extended Data Fig. 6d). On the basis 
of our previous demonstration of a role for IL-4 produced by meningeal 
T cells in spatial learning behaviour!, we assessed whether deficiency 
in IL-4 would also result in social deficits. IL-4-deficient mice did not 
demonstrate social deficits; in fact, they spent more time investigat- 
ing a novel mouse than a novel object compared with wild-type mice 
(Extended Data Fig. 6e). 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a Social Ifng~~ mice b Correlation matrix Cc rsfMRI d Social Ifng~- mice 
LFA LPrL Lins LPrh 
250 0.8 300 
= d2<<oQne dH<fogue s eae pois owe Lins ae ** [Control 
> 200 a@awnwooOfst qakwoorts Zz o/- - = Wl IFN-y 
2 Sesoescse seseseso ey 2 NS 
% 450 L PrL oa Tog 200, © 
2 R Prt co Da 
8 100 a FA | 0.5 2 02 8 
E OC rs “3 O = 100 
ROC a 0 = 
£ 20 Lins 8 0.0 z 
i R Ins 0.5 i= 
0 ifng** ifng-- -0.2 L$ —_____ ees 
Bebe eee? gi 3% 
ssessexsss =6 56 
e RNA FISH f Surface expression Q Social /fngr1™': AAV-Syn 
PSD-95 __CD11B NeuNt 
ae 205 7 25 200. mi Cre- 
. ——- g 25TngT oa S . eS ee NS 
10 & 18 tngrt~ 15. Eh | = = — WiCre* 
fi 3 jo) | ? 450 
19 5s 5) 8 WA = 
1084 9 oL_L 2 
1024 sy CD11B+ 8 100 
4) to 2 40 al 60 { is 
) 5 30 \! 40 {| o 50 
2! 400 2 20 {i 20. {| E 
10° 10" 102 103 104 108108 © 10 y \\ 6 }\ = ‘5 
———— on Oo 
CD11B IFNGR1 IFNGR2 g 8 g 8 
£6 26 
hh Social Stat?™''::vgat-Cre i Biccuculine Tonic inhibition k Seizure latency Social Ifng’~ mice 
a Cre“ = 0 25 250, BSaline 
= = S 
o i Cre* ro — — Moz x, 
E 200 NS 100 pa = -100 = 20 “ 2 200 ~ cs 
7 é seizure severity = 
2 150 Tomin £ £45 et & 150] 
Q 3 -200 fy qo 3 
E 100 ped ahs 2 8 10 F 2 fa 100 
2 50 3 7300 45 g 2 50 
F - OCHINy f= 
0 -400 0 Treatment 0 
25 84 150 pa Pre Post 012345678 83 B38 
23 33 1 min Time ‘ 35 ae 
2828 Seizure severity 28 28 


Figure 2 | IFN-y supports proper neural connectivity and social 
behaviour. a, Tfng mice exhibit social deficits (n = 16, 12 mice per 
group; ANOVA for genotype (F1,52) = 8.327, P< 0.01; **P< 0.01, Sidak’s 
post hoc test; pooled two independent experiments). b, Correlation matrix 
from Ifng*!* and Ifng-/~ mice. c, Box and whisker plots of correlation 
values (n = 8 mice per group; *P < 0.05; # P= 0.06, t-test; repeated twice. 
d, A single CSF injection of IFN-7 (20 ng) was sufficient to rescue social 
deficits in Ifng-'~ mice 24h after injection (n = 14, 11 mice per group; 
ANOVA for interaction (Fj,46) = 10.22 P< 0.01; ***P < 0.001, Sidak’s post 
hoc test; pooled two independent experiments). e, Expression of IFN-+ 
receptor subunit mRNA by fluorescent in situ hybridization in slices from 
mouse PFC. RNA probes and corresponding colours: left, psd95-blue 
(neurons); right, CD11B-blue (microglia); top, IFNGR1-red; bottom, 
IFGR2-red. Yellow arrowheads denote co-localization. f, Expression of 
IFN-7 receptor subunit protein by flow cytometry. Cells were gated on 
Hoechst+/live/single then neurons and microglia were gated on NeuN and 
CD11B, respectively. Ifngr1~/~ mice and no primary antibody for IFNGR2 
were included as negative controls. g, Deleting Ifngr1 from neurons in 

the PFC caused social deficits. Ifngr1"" mice were injected with AAV- 
Syn-CRE-GFP into the PFC and tested for social behaviour 3 weeks after 


To determine which cell types in the brain respond to IFN-7, we 
analysed mouse PFC for the expression of IFN-7y receptor subunits 1 
and 2 and found that both neurons and microglia express mRNA and 
protein for R1 and R2 subunits of the IFN-7y receptor (Fig. 2e, f and 
Extended Data Fig. 7). Microglia are CNS-resident macrophages and are 
known to express the IFN-y receptor'”. However, genetically deleting 
STAT1, the signalling molecule downstream of the IFN-y receptor, 
from microglia (and other cells of myeloid origin), did not disturb 
normal social preference (Extended Data Fig. 8). These results led us to 
focus on neuronal responses to IFN-7as they relate to social behaviour. 

To assess a role for IFN-1 in neuronal signalling, we deleted Ifngr1 in 
PFC neurons via adeno-associated virus (AAV) delivery of Cre recombi- 
nase under the Synapsin I promoter (Extended Data Fig. 9). Attenuating 
IFN-7 signalling in PFC neurons was sufficient to alter mouse 
behaviour in a three-chamber social task and result in a lack of social 
preference (Fig. 2g), reinforcing the importance of IFN-y signalling on 
neurons for social behaviour. Injecting recombinant IFN-7 into the CSF 
activated layer I neocortical neurons as assessed by c-fos immunore- 
activity (Extended Data Fig. 10a, b). These neurons are almost entirely 
inhibitory'®, suggesting that IFN-y may drive regional inhibition of 
circuits by directly activating layer I inhibitory neurons located in close 


injection (n= 11, 12 mice per group; ANOVA for genotype (Fj,21) = 10.62, 
P<0.01; *P < 0.05, Sidak’s post hoc test; pooled three independent 
experiments). h, Vgat(*::Stat1"" mice exhibit social deficits (n= 10, 11 
mice per group; ANOVA for interaction (Fj,19) = 10.30, P< 0.01; 

*** P< 0.001, Sidak’s post hoc test; pooled three independent 
experiments). i, Layer I/II neurons in slices from wild-type mice are held 
under tonic GABAergic inhibition (top), which is blocked by the GABA-A 
receptor antagonist bicuculline. IFN-y increases tonic GABAergic 
inhibitory current (n= 11 cells from four mice; bottom). j, Holding 
current before and during IFN-y (P= 0.01 t-test). k, IFN-y increased 
latency to reach each seizure stage (n = 6 mice per group; ANOVA with 
repeated measures <0.001; ***P < 0.001, Sidak’s post hoc test) and 

(inset) reduced the maximum severity of seizures (***P < 0.001, t-test; 
repeated twice). 1, Diazepam rescued social deficits in Ifng-/~ mice (n= 12 
mice per group; ANOVA for interaction (F,,72) = 9.204 < 0.01; **P<0.01, 
Sidak’s post hoc test; repeated twice). Data from the three-chamber test 

(a, d, g, h, i) were analysed by applying a two-way ANOVA for social 
behaviour and genotype/treatment, followed by a Sidak’s post hoc test. 
Bars represent average mean times investigating + s.e.m. All experiments 
were repeated at least once. 


proximity to the brain surface and CSF. To investigate mechanisms 
downstream of the IFN-7y receptor, we used Veat'"::Stat f/f 
mice. Deletion of STAT1 from GABAergic inhibitory neurons was 
sufficient to induce deficits in social behaviour (Fig. 2h), suggesting that 
the IFN-7y may be signalling through inhibitory neurons. 

To directly assess if IFN-7 can drive inhibitory tone in the PFC, 
we measured inhibitory currents in layer II/III pyramidal cells from 
acutely prepared brain slices from wild-type mice. In addition to 
receiving phasic inhibitory synaptic input, these cells are also held 
under a tonic GABAergic current that serves to hyperpolarize their 
resting membrane potential (Fig. 2i). Tonic GABAergic currents are 
extrasynaptic and can yield long-lasting network inhibition’®. We 
observed that IFN-7y augmented tonic current (Fig. 2i, ) and Extended 
Data Fig. 10c, d), suggestive of elevated levels of ambient GABA dur- 
ing application of IFN-y. Deleting the IFN-7 receptor from inhibitory 
neurons (Vgat::Ifngr “" mice) prevented IFN-y from augmenting 
tonic inhibitory current (Extended Data Fig. 10c). Given that IFN-y 
promotes inhibitory tone, we tested whether IFN-7y could prevent 
aberrant neural discharges by injecting IFN-7y into the CSF and then 
chemically inducing seizures with the GABA type A receptor antag- 
onist pentylenetetrazole (PTZ). Mice injected with IFN-7 were less 
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Figure 3 | Over-representation of IFN-+ transcriptional signature 
genes in social behaviour-associated brain transcriptomes of rat, 
mouse, zebrafish, and Drosophila. a—c, GSEA plots demonstrate the 
over-representation of IFN-7 transcriptional signatures (derived from 
Molecular Signature Database C2, GSE33057, or from ref. 30) in brain 
transcriptomes of (a) mice and (b) rats subjected to social or isolated 
housing and in brain transcriptomes of (c) domesticated zebrafish 
compared with a wild zebrafish strain. d, Over-representation of 
JAK/STAT pathway transcriptional signature genes (derived from 
GSE2828) in head transcriptomes of flies selected for low aggressive 


susceptible to PTZ-induced seizures: IFN-7 delayed seizure onset 
and lowered seizure severity (Fig. 2k). Further, to test whether over- 
excitation causes social deficits in IFN-7-deficient mice, we treated 
Ifng-'~ mice with diazepam to augment GABAergic transmission” 
Diazepam successfully rescued social behaviour of Ifng~'~ mice, 
similar to the effect observed with recombinant IFN-7y treatment 
(Fig. 21), suggesting that social deficits, caused by a deficiency in IFN-1, 
may arise from inadequate control of GABAergic inhibition by IFN-». 

It is intriguing that IFN-+, predominately thought of as an anti- 
pathogen cytokine, can play such a profound role in maintaining proper 
social function. Since social behaviour is crucial for the survival of a 
species and aggregation increases the likeliness of spreading pathogens, 
we hypothesized that there was co-evolutionary pressure to increase 
an anti-pathogen response as sociability increased, and that the IFN-y 
pathway may have influenced this co-evolution. To test this hypothesis, 
we analysed metadata of publically available transcriptomes from mul- 
tiple organisms including the rat, mouse, zebrafish, and fruit fly. Using 
GSEA, we determined that transcripts from social rodents (acutely 
group-housed) are enriched for an IFN-7 responsive gene signature 
(Fig. 3a, b and Supplementary Tables 3-7). Conversely, rodents that 
experienced social isolation demonstrated a dramatic loss of the IFN-y 
responsive gene signature (Supplementary Tables 3-7). Zebrafish and 
flies showed a similar association between anti-pathogen and social 
responses (Fig. 3c, d). We observed that immune response programs 
were highly enriched in the brain transcriptomes of flies selected for low 
aggressiveness traits (a physiological correlate for socially experienced 
flies”!; Fig. 3d and Supplementary Tables 3-7). We next analysed the 
promoters of these highly upregulated social genes and found them to 
be enriched for STAT1 transcription factor binding motifs (Fig. 3a—d). 
These data suggest that, even in the absence of infection, an IFN-y gene 
signature is upregulated in aggregated organisms. This is consistent 
with an interaction between social behaviour and the anti-pathogen 
response, a dynamic that could be mediated by the IFN-y pathway. 
Since low-aggressive flies upregulate genes in the JAK/STAT pathway 
(canonically downstream of IFN-y receptors in higher species), yet 
lack IFN-7 or T cells, it is intriguing to speculate that T-cell-derived 
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behaviour (behavioural readout for social behaviour in flies (see Methods 
section ‘Meta-data analysis for more details)). Genes are ranked into an 
ordered list according to their differential expression. The middle part of 
the plot is a bar code demonstrating the distribution of genes in the 
IFN-* transcriptional signature gene set against the ranked list of genes. 
The list on the right shows the top genes in the leading-edge subset. 
Promoter regions of these genes were scanned for transcription factor 
binding sites using MEME suite. MEME output demonstrates significant 
STAT transcription factor binding site enrichment in cis-regulatory 
regions of leading-edge genes upregulated in a social context. 


IFN-7 may have evolved in higher species to more efficiently regulate 
an anti-pathogen response during increased aggregation of individuals. 

Our results reveal a novel role for meningeal immunity in regulating 
neural activity and social behaviour through IFN-y. The role of immune 
molecules has been previously shown to control brain development and 
function”, anda role for cytokines in influencing behaviour has been 
proposed, primarily in the context of sickness behaviour and pain?””’. 
These examples, however, have predominately focused on peripheral 
nerves and non-neuronal targets. Here we show that CNS neurons 
directly respond to IFN-7 derived from meningeal T cells to elevate 
tonic GABAergic inhibition and prevent aberrant hyper-excitability 
in the PFC. These data suggest that social deficits in numerous neu- 
rological and psychiatric disorders may result from impaired circuitry 
homeostasis derived from dysfunctional immunity. On the basis of our 
findings, it is also plausible that subtle homeostatic changes in menin- 
geal immunity may also contribute to modulating neuronal circuits 
that are responsible for our everyday behaviours and personality. Given 
this communication between immunity and neuronal circuits”, it is 
intriguing to hypothesize that these pathways might be vulnerable to 
manipulation by fast-evolving pathogens. Improved comprehension of 
these pathways could improve our understanding of the aetiology of 
neurodevelopmental and neuropsychiatric disorders associated with 
maternal infections and/or aberrant inflammation and may result in 
the development of new therapeutic targets. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Mice. All mice (C57BL/6) were either bred in-house or purchased from the Jackson 
Laboratory. For each individual experiment, the control mice were obtained from 
the same institution as test mice. In the case that mice were purchased, they were 
maintained for at least 1 week to habituate before manipulation/experimentation. 
When possible, mice used for experiments were littermates. These include all elec- 
trophysiology experiments, all cohorts using Stat" mice, experiments analysing 
induced seizures, experiments counting c-fos* neurons, and experiments using 
Il4-'~ mice. Experiments assessing the social behaviour of SCID mice using the 
three-chamber assay included mice bred in-house and mice purchased from the 
Jackson Laboratory. All other experiments used mice purchased from the Jackson 
Laboratory. Experimental groups were blinded and randomly assigned before the 
start of experimentation and remained blinded until all data were collected. Mice 
were housed under standard 12h light/dark cycle conditions in rooms equipped 
with control for temperature and humidity. Unless stated otherwise, male mice 
were tested at 8-10 weeks of age. Sample sizes were chosen on the basis of a power 
analysis using estimates from previously published experiments. All experiments 
were approved by the Institutional Animal Care and Use Committee of the 
University of Virginia. 

Behaviour. For cohorts tested with multiple behavioural assays, the elevated 
plus-maze was performed first and then followed by the open field before any other 
assay. Before all experiments, mice were transported to the behaviour room and 
given 1h to habituate. All behavioural testing was conducted during daylight hours. 
Elevated plus-maze. Mice were placed into the centre hub and allowed to explore 
the plus-maze for 10 min. Video tracking software (CleverSys) was used to quantify 
time spent in the open arms. 

Open field. Mice were placed into the open field (35 cm x 35cm) and allowed to 
explore for 15 min. Total distance and time spent in the centre (23cm x 23cm) 
were quantified using video tracking software (CleverSys). 

Rotarod. Mice were placed on an accelerating rotarod (MedAssociates) that 
accelerated from 4.0 to 40 rpm over 5 min. Infrared beams were used to quantify 
the latency of a mouse to fall of the rod. Mice were given six trials with a 4-h break 
between trials 3 and 4. 

Three-chamber sociability assay. The three-chamber sociability test was 
conducted as previously described*". Briefly, mice were transported to the testing 
room and habituated for at least 1h. The room was maintained in dim light and 
a white noise generator used to mitigate any unforeseen noises. Test boxes were 
fabricated, in-house, by a machine shop. Test mice were placed in the centre 
chamber with the two outer chambers containing empty wire cages (Spectrum 
Diversified Designs) and allowed to explore for 10 min (habituation phase). After 
the habituation phase, mice were returned to the centre chamber. A novel mouse 
(an 8- to 10-week-old male C57BL/6J, ~18-22g) was placed under one cup and 
an object placed under the other. Before testing, the novel mouse was habituated 
to the cup by being placed under the cup for 10 min, three times a day for 5 days. 
Mice were allowed to explore for an additional 10 min (social phase). A video 
tracking system (CleverSys) was used to quantify the time spent around each target. 
For three-chamber experiments comparing social behaviour between wild-type 
and SCID mice, data collected using males and females were combined because 
no significant effects were found between genders. When females were tested, 
juvenile C57BL/6] male (approximately 4 weeks old) mice were used as novel 
demonstrators. 

Novel/social environment. Mice were single housed and maintained unmanipu- 
lated for 5 days, then placed in a novel/social environment with a mouse (opposite 
sex and identical scid genotype) under video surveillance as previously described*!. 
After 2h, mice were euthanized and prepared for immunohistochemistry as 
described below. The time spent interacting was measured by a blinded observer 
for 5 min at the 30, 60, and 90 min post-introduction time points. 

Resting-state fMRI. The protocol for rsfMRI was adapted from Zhan et al.!?. Mice 
were maintained under light anaesthesia with (1-1.25%) isoflurane and images 
were acquired on a 7.0 T MRI Clinscan system (Bruker, Etlingen, Germany) using 
a 30-mm inner-diameter mouse whole-body radiofrequency coil. High-resolution 
structural images were acquired by collecting 16 0.7-mm-thick coronal slices using 
TR/TE 5,500/65 ms and an 180° flip angle. A BOLD rsfMRI time series of 16 
0.7-mm-thick coronal slices was collected using TR/TE 4,000/17 ms and a 60° flip 
angle. For analysis, a structural template*” was custom labelled using The Mouse 
Brain by Paxinos and Franklin as reference (Extended Data Fig. 3). The non-rigid 
transformation between the template and each individual mouse was estimated 
using the open source Advanced Normalization Tools (ANTs) package** that was 
then used to propagate the regional labels to each subject. Cleaning of the bold 
fMRI data was performed using tools available in ANTSR**—a statistical and 
visualization interface between ANTs and the R statistical project. {MRI preproc- 
essing consisted of motion correction*’, bandpass filtering (frequency =0.002-0.1), 
and CompCor estimation*®. The correlation matrix was determined from the 


clean fMRI data using the regions labels. To determine whether the functional 
connectivity between sample groups was significantly different, we tested for the 
equality of their corresponding correlation matrices. First, an aggregate correlation 
matrix was constructed for each group by calculating the median value for each 
connection, and then the aggregate matrices were compared using the Jennrich 
test*”, as implemented in the cortest.jennrich function in the R psych package. 
To create a functional connectivity network for a sample group, a correlation 
threshold was applied to the connections between regions of interest. If a 
connection strength was above the threshold, it was kept as an edge in the network, 
otherwise it was discarded. When comparing networks from multiple sample 
groups, the threshold was determined by calculating the maximum threshold that 
left one of the networks connected (that is, it was possible to reach any node in the 
network from any other node). 

Immunohistochemistry. Mice were killed under Euthasol then transcardially 
perfused with PBS with heparin. Brains were removed and drop fixed in 4% PFA 
for 48h. After fixation, brains were washed with PBS, cryoprotected with 30% 
sucrose, then frozen in O.C.T. compound (Sakura Finetek) and sliced (401M) with 
a cryostat (Leica). Free-floating sections were maintained in PBS + Azide (0.02%) 
until further processing. Immunohistochemistry for c-fos (1:1,000 dilution; 
Millipore) on free-floating sections was performed as previously described**. 
Depletion of meningeal T cells. A rat monoclonal antibody to murine VLA4 
(clone PS/2) was affinity purified from hybridoma supernatants and was used 
with the permission of K. Ley (La Jolla Institute of Allergy and Immunology, San 
Diego, CA). Mice were given two separate injections (intraperitoneal; 0.2 mg in 
saline per mouse), 4 days apart, of either anti- VLA4 or rat anti- HRP for control 
(clone HRPN; BioXcell), then tested 24h after final injection. 

Dissection of meninges and flow cytometry. Meninges were dissected as previ- 
ously described". Briefly, after killing and perfusing, skulls caps were removed by 
making an incision along the parietal and squamosal bones. The meninges were 
removed from the internal side of the skull cap and gently pressed through a 70 1m 
nylon mesh cell strainer with sterile plastic plunger (BD Biosciences) to isolate 
a single cell suspension. Cells were then centrifuged at 300g at 4°C for 10 min, 
resuspended in cold FACS buffer (pH 7.4; 0.1M PBS; 1mM EDTA; 1% BSA), 
and stained for extracellular markers using the following antibodies at a 1:200 
dilution: CD45 PerCP-Cyanine5.5 (eBioscience), TCR BV510 (BD Bioscience), 
CD4 FITC (eBioscience), L/D Zombie NIR (BioLegend). To measure intracellular 
IFN-\, single-cell isolates from meninges were maintained in T-cell isolation buffer 
(RPMI + 2% FBS, 2mM L-glutamine, 1 mM sodium pyruvate, 10 mM HEPES, 
1x non-essential amino acids, and 1x Antibiotic-Antimycotic (Thermo Fisher) 
and stimulated with PMA/ionomycin (Cell Stimulation Cocktail - eBioscience) 
+ 10,1g/ml brefeldin A at 37°C before extracellular staining as stated above. Cells 
were then permeabilized with Cytofix/Cytoperm (BD Biosciences) and stained 
with IFN-y APC (eBioscience; clone XMG1.2). 

To measure expression of IFN-7 receptors, brains were removed and placed 

in Neurobasal media containing 10% fetal bovine serum. The meninges were 
removed from the brain and the frontal cortex was micro-dissected under a 
dissection microscope. Using a 2 ml dounce homogenizer, brains were homog- 
enized in Neurobasal media with 50 U/ml Dnase I. The homogenate was passed 
through a 70j.m nylon filter and washed with cold FACS buffer. IFNGR were 
labelled with anti-IFNGR1 Biotin (BD Pharmingen; GR20) or rabbit anti-IFNGR2 
(Santa Cruz; M-20). Cells were washed with FACS buffer and incubated with 
FITC conjugated streptavidin or 488 conjugated chicken anti-rabbit. Next, cells 
were washed again then incubated with CD11B PE-Cyanine7 (eBioscience), 
L/D Zombie NIR (BioLegend), and Hoechst for 45 min. Cells were washed, then 
permeabilized and fixed with Cytofix/Cytoperm (BD Biosciences). After another 
wash with permeabilization wash buffer (PBS with 10% fetal bovine serum, 1% 
sodium azide, and 1% saponin; pH 7.4), cells were incubated overnight with NeuN 
PE (Millipore). Cells underwent a final wash and again passed through a 701m 
nylon filter. Samples were run on a flow cytometer Gallios (Beckman Coulter) then 
analysed using FlowJo software (Treestar). 
SCID repopulation. SCID mice were repopulated with cells from spleen and 
lymph nodes (axillary, brachial, cervical, inguinal, and lumbar). Spleen and lymph 
nodes were collected from a 3- to 4-week-old donor and passed through a 70,1m 
nylon mesh cell strainer with a sterile plastic plunger. ACK buffer was used to lyse 
red blood cells before washing with saline. Cells were counted on an automated 
cell counter (Nexcelom) and injected (intravenously) at 5 x 10° cells in 250 ul of 
saline (control mice were injected with 250] of saline only). For injections, the 
animal technician was blinded to the genotype of the mice and the content of the 
injection. Thus, all groups were handled identically. Mice aged 3-4 weeks were 
carefully placed into a tail vein injection platform. Their tails were briefly warmed 
using a heating pad and saline with cells or saline alone was slowly injected into 
the tail vein using a 28-gauge needle. After the injection, mice were returned to 
their home cage. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


IFN-7 injections. Mice were anaesthetized with a ketamine/xylazine (ketamine 
(100 mg/kg) and xylazine (10 mg/kg)) injection (intraperitoneal) or isoflurane 
(2%), then placed into a stereotaxic frame with the head at an approximately 
45° angle. The skin above the cisterna magna was cleaned and sanitized before 
a 1 cm incision was made. The underlying muscles were separated with forceps, 
retracted, and a small Hamilton syringe (33-gauge) was used to slowly inject 1 il 
of saline or IFN-y (20 ng; eBioscience) into the cisterna magna. After injection the 
syringe was held in place for 5 min to avoid back-flow of CSE. After the syringe was 
removed, muscles were put back in place and skin was sutured. Mice were placed 
on a heating pad and given ketoprofen and baytril for recovery. 

Induced seizures. IFN-7 was injected into the CSF through the cisterna magna as 
described above, 24h before inducing seizures. Control mice were injected with the 
same volume of saline. To induce seizures, mice were injected with PTZ (40 mg/kg; 
intraperitoneally). After injection, mice were placed into an empty housing cage 
and recorded for video analysis. Seizures were analysed by a blinded observer using 
a behaviour scoring system previously published*’. 

Diazepam treatment. Diazepam (1.25 mg/kg) was delivered intraperitoneally for 
30 min before testing for social behaviour. 

Fluorescence in situ hybridization. Mice were euthanized then transcardially 
perused with PBS with heparin followed by 4% PFA. Brains were then removed 
and drop fixed in 4% PFA for 24h, frozen in OCT, and 121M sections were cut 
on a cryostat. Fluorescence in situ hybridization was performed using RNA ISH 
tissue assay kits (Affymetrix) following the manufacturer’s protocol. Tissues were 
treated with protease for 20 min at 40°C. Images at 63 x magnification were 
acquired on a Leica TCS SP8 confocal system (Leica Microsystems) using LAS 
AF Software. 

AAV delivery. AAV1.hSyn.HI.eGFP-Cre.WPRE.SV40 and AAV 1.hSyn.eGFP. 
WPRE.bGH were purchased from Penn Vector Core. Ifngr 1" mice were pur- 
chased from the Jackson Laboratory. After 1 week of habituation, mice were 
anaesthetized with 2% isoflurane and injected bilaterally with 2 x 10!° genome 
copies of AAV virus in 11] at stereotaxic coordinates +-2.5,1m bregma A/P, 0.25 um 
lateral, 1.25 um deep. 

Measuring inhibitory currents. Visualized whole-cell patch-clamp recordings 
were performed on layer II/II prefrontal cortical neurons prepared from acute 
brain slices (adult) using the protective recovery method*’. Recordings were 
performed in 34°C artificial cerebrospinal fluid (ACSF) containing (in mM) 
131.5 NaCl, 25 NaHCOs, 12 p-glucose, 2.5 KCI, 1.25 NaH2POq, 2 CaCh, and 
1 MgCl. ACSF also contained 3 mM kynurenic acid to block synaptic excitation 
and 2.5mM NO-711 to enhance tonic inhibition’. Slices were incubated in this 
ACSF for 5-10 min before placement in the recording chamber. The patch pipette 
solution with elevated chloride contained (in mM) 140 CsCl, 4NaCl, 1 MgCh, 
10 HEPES, 0.05 EGTA, 2 Mg-ATP, and 0.4Mg-GTP*!. Once recordings 
equilibrated, baseline holding current in ACSF was measured for 3.5 min, after 
which ACSF containing IFN-+ (20 pg/ml) was applied for 8.5 min and then washed. 
Data presented show the mean holding current during the last minute of control 
(ACSF) and drug (ACSF + IFN-1) conditions. 

RNA isolation and sequencing. Eight-week-old male mice were purchased from 
the Jackson Laboratory and housed in standard housing boxes with either four 
mice per cage or isolated for 6 days. Mice were euthanized as described above and 
the PFC was microdissected under a dissection microscope. RNA was isolated 
using an RNAeasy mini kit (Qiagen) and a cDNA library was generated with a 
TruSeq Stranded mRNA Library Prep Kit (Illumina) with Agencourt AMPure 
XP beads for PCR cleanup. Samples were loaded onto a NextSeq 500 High-output 
75 cycle cartridge and sequenced on a NextSeq 500 (Illumina). 

Transcriptome analysis. Raw FASTQ sequencing reads were chastity filtered 
to remove clusters having outlying intensity corresponding to bases other than 
the called base. Filtered reads were assessed for quality using FastQC’”. Reads 
were splice-aware aligned to the UCSC mm9 genome using STAR®, and reads 
overlapping UCSC mm49 gene regions were counted using featureCounts“. The 
DESeq2 Bioconductor package“ in the R statistical computing environment" was 
used for normalizing count data, performing exploratory data analysis, estimat- 
ing dispersion, and fitting a negative binomial model for each gene comparing 
the expression from the PFC of mice in a social environment versus isolation. 
After obtaining a list of differentially expressed genes, log(fold changes), and 
P values, Benjamini-Hochberg false discovery rate procedure was used to correct 
P values for multiple testing. A gene set enrichment analysis (GSEA) algorithm” 
was applied to identify the enrichment of transcriptional signatures and molecular 
pathways in PFC transcriptomes of mice exposed to group and isolation housing 
conditions. Four thousand seven hundred and twenty-six publicly available tran- 
scriptional signatures were obtained from the molecular Signature Database C2 
version 4.0, and GSEA was used to examine the distribution of these curated gene 
sets in lists of genes ordered according to differential expression between group 
and isolation housing conditions. We analysed statistics by evaluating nominal 
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P value and normalized enrichment score (NES) on the basis of 1,000 random 
sample permutations. 

Meta-data analysis. The custom-made IFN-7 and pathogen-induced transcrip- 
tional signatures (Supplementary Table 2) were generated by retrieving genes 
upregulated at least twofold following IFN-7 stimulation or pathogen infection. 
All custom signatures were derived from publicly available transcriptomes down- 
loaded from Gene Expression Omnibus (GEO). Specifically, mammalian IFN-7 
transcriptional signatures were derived from transcriptomes GSE33057, GSE19182, 
GSE36287, GSE9659, GSE1432, and GSE6353. Zebrafish IFN-7 transcriptional 
signature was used as described**. Drosophila pathogen-induced JAK/STAT- 
dependent transcriptional signatures were derived from transcriptomes GSE54833 
and GSE2828. 

A GSEA algorithm was applied to identify the enrichment of custom-made 
mammalian IFN-7 transcriptional signatures in the publicly available brain cortex 
transcriptomes of mice and rats exposed to social aggregation, sleep deprivation, 
stress, psychostimulants (DOI, cocaine, amphetamine, methamphetamine, 
methylphenidate, caffeine, nicotine, and modafinil), antipsychotics (olanzapine, 
haloperidol, and risperidone), anticonvulsants (levetiracetam, phenytoin, 
ethosuximide, and oxcarbazepine), and antidepressants (iproniazid, moclobemide, 
paroxetine, and phenelzine). In total, 41 transcriptomes were analysed as indicated 
in Supplementary Table 2. We analysed statistics by evaluating nominal P value 
and NES on the basis of 1,000 random sample permutations. 

Zebrafish are social fish that aggregate into shoals*?->*. Various zebrafish 
strains differ in their preference for social interaction and novelty, resembling the 
phenotypic variation of inbred mouse strains*>. Notably, domesticated zebrafish 
strains demonstrate higher social interaction and social novelty preference** °° 
compared with wild zebrafish strains. Therefore, a GSEA algorithm was used to 
identify the enrichment of zebrafish IFN-y transcriptional signature in the publicly 
available whole-brain transcriptomic profiles of behaviourally distinct strains of 
domesticated and wild zebrafish (Supplementary Table 7). The brain transcrip- 
tomic profiles of domesticated (Scientific Hatcheries (SH) and Transgenic Mosaic 1 
(TM1)) and wild (Nadia, Gaighata) zebrafish strains were derived from publicly 
available transcriptome data set GSE38729. We analysed statistics by evaluating 
nominal P value and NES on the basis of 1,000 random gene set permutations. 

Social interactions in Drosophila melanogaster flies play an important role in 
courtship, mating, egg-laying, circadian timing, food search, and even lifespan 
determination®”~. Notably, a number of studies have demonstrated that social 
isolation leads to an aggressive behaviour in Drosophila flies, whereas group 
housing suppresses the aggressiveness***", Therefore, we employed a GSEA 
algorithm to identify the enrichment of JAK/STAT-dependent transcriptional 
signatures in the publicly available brain transcriptomes of Drosophila popula- 
tions that were socially-induced or genetically-selected for low-aggressive (that 
is, social) behaviour (Supplementary Table 7). The transcriptomic profiles were 
derived from publicly available transcriptome data sets GSE5404 and GSE6994. 
We analysed statistics by evaluating nominal P value and NES on the basis of 1,000 
random gene set permutations. 

Promoter motif analysis. A GSEA algorithm was used to identify genes that 
are differentially expressed in brain transcriptomes of mice and rats exposed to 
social aggregation, domesticated zebrafish strains, and low-aggressive Drosophila 
melanogaster populations, compared with their control counterparts. High-scoring 
differentially expressed ‘leading-edge’ social genes were selected on the basis of 
their presence in the IFN-y and pathogen-induced transcriptional signatures. 
Specifically, as shown in Fig. 3, we have identified 31, 48, 14, and 53 leading-edge 
genes in brain transcriptomes of mice and rats exposed to social aggregation, 
domesticated Zebrafish strain, and low-aggressive Drosophila melanogaster 
population, respectively. We next extracted promoter sequences of 200 bp upstream 
of transcription start sites of these ‘leading-edge’ genes using UCSC Genome 
Browser (https://genome.ucsc.edu/). The MEME suite was then used to discover 
overrepresented transcription factor binding motifs, as described. MEME param- 
eters used were any number of motif repetitions per sequence, with a minimum 
motif width of 5 bases and maximum motif width of 15 bases. The discovered 
MEME motifs were compared using Tomtom analysis. In this case, the Tomtom 
motif similarity analysis ranks the MEME motif most similar to the vertebrates 
in vivo and in silico. The statistics were determined using Euclidean distance. 
Circos plot. A GSEA algorithm was applied to identify the enrichment of IFN-7, 
IL-4/IL-13, IL-17, and IL-10/TGF-8 signalling pathways in the brain transcrip- 
tomes of rodent animals exposed to social aggregation, stress, psychostimulants, 
antipsychotics, and antidepressants. All custom signatures were derived from 
publically available rodent transcriptomes downloaded from Gene Expression 
Omnibus. Statistical significance of GSEA results was assessed using 1,000 sample 
permutations. A NES greater than 1.5 and a nominal P value less than 0.05 was used 
to determined pairwise transcriptome connectivity. A Circos graph was generated 
using Circos package 0.68.12 (ref. 66). 
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Statistics. Data were analysed using the statistical methods stated in each figure 
legend. For the three-chamber assay, a two-way ANOVA was performed using 
genotype/treatment and sociability as main effects, followed by applying a Sidak’s 
post hoc comparison to assess if the group had a significant social preference. 
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Extended Data Figure 1 | SCID mice have no observable anxiety, 
motor, or olfactory deficits. a, The three-chamber sociability assay was 
used to test social behaviour. b, Neither wild-type nor SCID mice had 

a side bias in the habituation phase (empty cups) of the three-chamber 
assay (n= 6; repeated at least three times). c, There was no effect of 
genotype on distance travelled in the three-chamber assay during the 
habituation phase (n = 6; repeated at least three times). d, Both wild-type 
and SCID mice had an olfactory preference to urine, suggesting normal 
olfactory behaviour (m= 8 mice per group; ANOVA for urine preference 
Fi 3 = 31.01; P< 0.0001; ***P < 0.001, **P < 0.01, Sidak’s post hoc test; 
single experiment). e, Percentage time spent in the open arms of 
plus-maze (n= 22 mice per group; pooled two independent experiments). 
f, Number of entries into the open arms of the plus-maze (n = 22 mice 
per group; pooled two independent experiments). g, Total arm entries of 
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plus-maze (n = 22 mice per group; pooled two independent experiments). 
h, Percentage time spent in the centre of the open field (n = 22 mice 

per group; pooled two independent experiments). i, Total ambulatory 
distance in the open field (n = 22 mice per group; pooled two independent 
experiments). j, Latency to fall off the accelerating rotarod (n =8 mice 

per group; single experiment). k, SCID mice spent less time investigating 
each other than wild-type mice spent investigating each other when placed 
into a novel social environment (n = 5 mice per group; repeated-measures 
ANOVA for genotype Fi,21 = 5.708 *P < 0.05; single experiment). 

1, Repopulated SCID mice have similar numbers (m) and percentages 

(n) of meningeal T cells as wild-type mice (n = 4-5 mice per group; 
repeated at least three times). Cells were gated on singlets, live, CD45", 
and TCR. 
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Extended Data Figure 2 | Neuroanatomical structures analysed by 
rsfMRI. a, Regions of interests (ROIs) were generated using The Mouse 
Brain by Paxinos and Franklin as a reference. Representative slices 

were extracted from ref. 32. Abbreviations are as follows: FrA, frontal 
association cortex; PrL, prelimbic cortex; OrbC, orbital cortex; 

OB, olfactory bulb; MC, motor cortex; SocC, somatosensory cortex; 

Ins, insula; PirF, piriform cortex; CpU, caudate putamen; Acb, accumbens; 
ACC, anterior cingulate cortex; dHip, dorsal hippocampus; T, thalamus; 
Amyg, amygdala; EntC, entorhinal cortex; Hyp, hypothalamus; VisC, 
visual cortex; SupC, superior colliculus; PAG, periductal grey; DpMe, deep 
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mesencephalic nucleus; vHip, ventral hippocampus; SNR, substantia 
nigra; VTA, ventral tegmental area; CB, cerebellum; BS, brain stem. 

b, Connectivity of local PFC/insular nodes. Correlation thresholds were 
applied to visualize the strength of the connection. Connections that pass 
a high threshold are shown in red; connections that pass a lower threshold 
are shown in dashed grey. SCID mice have aberrant hyper-connectivity 

in the PFC (n = 8-9 mice per group; P < 0.05, Jennrich test; two pooled 
independent experiments). c, c-fos cells in the hippocampus (n = 9-10 
mice per group; single experiment). 
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Extended Data Figure 3 | Acute reduction of meningeal T cells with 
anti-VLA4. a, Anti- VLA4 depletes meningeal T cells. Meninges were 
dissected and single-cell suspensions were immunostained. T cells were 
gated on live, single, CD45*, TCR* events and counted by flow cytometry. 
b, Acute injection of anti-VLA4 reduced the amount of TCR* T cells in the 
meninges (n= 4 mice per group; *P< 0.01; repeated at least twice). 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


aE 
Ca wrens cover? 


a 
| MCOTWNE so 


Extended Data Figure 4 | Circos plot showing the connectivity of Th1 response and social aggregation. Labels are shown for the data sets analysed 
and presented in Fig. Lh. 
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and IFN-+-deficient mice have normal levels of anxiety and motor arms of the plus-maze (n = 20 mice per group; pooled two independent 
behaviour. a, A substantial percentage of meningeal T cells produce experiments). e, Percentage time spent in the centre of the open field 
IFN-¥. Cells were gated for live, singlets, CD45*, and TCR®. Ifng-/~ mice (n= 20 mice per group; pooled two independent experiments). f, Total 
were used to gate for IFN-7y staining. b, Percentage time spent in open ambulatory distance in the open field (n = 20 mice per group; pooled two 
arms of the plus-maze (1 = 20 mice per group; pooled two independent independent experiments). g, Latency to fall off the accelerating rotarod 
experiments). c, Entries into the open arms of plus-maze (n = 20 mice (n=8 mice per group; single experiment). 
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Extended Data Figure 6 | IFN-+ signalling is necessary for normal by IFN-7 (Jennerich test; P= 0.02; pooled two independent experiments). 
social behaviour. a, Repopulating SCID mice with wild-type lymphocytes __c, Ifngr1~/~ mice have social deficits (1 =6 mice per group; ANOVA for 
rescued a social preference; repopulating with Ifng~/~ lymphocytes did interaction P= 0.01; **P <0.01 Sidak’s post hoc test) that were not rescued 
not rescue a social preference; ANOVA for social behaviour Fj,14= 11.99; by injecting IFN-7 into the CSF (d; n = 5-6 mice per group; ANOVA for 
P=0.0038 (**P < 0.01; n=8 mice per group; single experiment). interaction P=0.01; **P < 0.01 Sidak’s post hoc test; single experiment). 
b, Connectivity of local PFC/insular nodes. Correlation thresholds were e, 114~/~ mice spend more time than wild-type mice investigating a novel 


applied to visualize the strength of the connection. Connections that pass mouse; ANOVA for genotype F132 = 5.397; P= 0.0267 (*P < 0.05 Sidak’s 
a high threshold are shown in red; connections that pass alower threshold _ post hoc test; n = 16-18 mice per group; pooled three independent 

are shown in dashed grey. Ifng-/~ mice have more connections than wild- experiments). 

type mice (Jennerich test; P= 0.0006). These connections were reduced 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


“ i | 8 
2 x . <= a 
@ Ol uy g $s 
F z it 1 = a pt z i roe core 
x i wou ow oto op 
FSc-A FSC-A, FSC-W 55-4 CD1I1B 


Extended Data Figure 7 | Gating strategy for neurons and microglia. Brain homogenates were stained and analysed by flow cytometry. Cells were 
gated on nucleated, singlets, and live. Neurons were then gated on NeuN-positive and microglia on CD11B-positive cells. 
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Extended Data Figure 8 | IFN-7+ signalling in microglia is not necessary 
for normal social function. Mice deficient for STAT1 in microglia 

have normal social preference (n =9 mice per group; ANOVA for 

Cre F\,16 = 1.809 and sociability F),156= 30.10; P< 0.0001; **P < 0.01; 
** P < 0.001 Sidak’s post hoc test; pooled two independent experiments). 
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Extended Data Figure 9 | Deleting IFNGR1 by AAV transduction. 

Mice were injected with AVVs expressing Cre and GFP under a synapsin 
promoter. a, GFP fluorescence in the PFC. Atlas image adapted from 

2015 Allen Institute for Brain Science, Allen Brain Atlas. Available from: 
http://www.brain-map.org. b, GFP fluorescence is only observed in NeuN* 
neurons, not Iba* microglia (top, 20x; bottom 63 x objective). 
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the cisterna magna) 2h before killing and processing brains for and somatosensory cortex (d; n =6 neurons from three mice). 
immunohistochemistry. Slices were stained for c-fos. Atlas image adapted _c, Vgat*"*::Ifugr" mice. IFN-y increased tonic inhibition in Cre~ mice 
from 2015 Allen Institute for Brain Science, Allen Brain Atlas. Available (n= 6-7 cells from four mice per group; **P < 0.01 Sidak’s post hoc test). 


from: http://www.brain-map.org. b, Total c-fos* cells in layer I of the 
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Identification of proliferative and mature B-cells in 


the islets of Langerhans 


Erik Bader!***, Adriana Migliorini>**, Moritz Gegg!*, Noah Moruzzi!“, Jantje Gerdes!, Sara S. Roscioni!, Mostafa Bakhti', 
Elisabeth Brandl', Martin Irmler*.°, Johannes Beckers®®’, Michaela Aichler®, Annette Feuchtinger®, Christin Leitzinger’, 
Hans Zischka’, Rui Wang-Sattler’, Martin Jastroch>!, Matthias Tschép>!°, Fausto Machicao*!, Harald Staiger®), 
Hans-Ulrich Haring>!”, Helena Chmelova>!*“, Julie A. Chouinard>!3-4, Nikolay Oskolkov!’, Olle Korsgren!®, 


Stephan Speier®!*4 & Heiko Lickert>?> 


Insulin-dependent diabetes is a complex multifactorial disorder 
characterized by loss or dysfunction of 3-cells. Pancreatic 3-cells 
differ in size, glucose responsiveness, insulin secretion and precursor 
cell potential); understanding the mechanisms that underlie this 
functional heterogeneity might make it possible to develop new 
regenerative approaches. Here we show that Fltp (also known as 
Flattop and Cfap126), a Wnt/planar cell polarity (PCP) effector and 
reporter gene®, acts as a marker gene that subdivides endocrine cells 
into two subpopulations and distinguishes proliferation-competent 
from mature 8-cells with distinct molecular, physiological and 
ultrastructural features. Genetic lineage tracing revealed that 
endocrine subpopulations from Fltp-negative and -positive lineages 
react differently to physiological and pathological changes. The 
expression of Fltp increases when endocrine cells cluster together 
to form polarized and mature 3D islet mini-organs’ °. We show that 
3D architecture and Wnt/PCP ligands are sufficient to trigger B-cell 
maturation. By contrast, the Wnt/PCP effector Fltp is not necessary 
for 3-cell development, proliferation or maturation. We conclude 
that 3D architecture and Wnt/PCP signalling underlie functional 
(3-cell heterogeneity and induce 3-cell maturation. The identification 
of Fltp as a marker for endocrine subpopulations sheds light on the 
molecular underpinnings of islet cell heterogeneity and plasticity 
and might enable targeting of endocrine subpopulations for the 
regeneration of functional 3-cell mass in diabetic patients. 

The islet of Langerhans is a complex mini-organ that regulates energy 
metabolism’®. Pancreatic 3-cells are organized in polarized rosette-like 
structures around blood vessels’ and are coupled by gap junctions to 
allow them to sense glucose and secrete appropriate amounts of insu- 
lin into the blood stream!. The conserved Wnt/planar cell polarity 
(PCP) pathway regulates the orientation of cells and organelles within 
the plane of a tissue and thus determines the function of cells in an 
organ'', As self-organized 3D tissue architecture is required for 
organ formation and terminal differentiation!*!>, we tested whether 
acquisition of tissue polarity and architecture could influence (-cell 
heterogeneity and maturation. 

To analyse the establishment of tissue polarity at the molecular level, 
we used the Fitp~” knock-in/knockout mouse, in which the entire 
open-reading frame of Fitp is replaced by a multicistronic lacZ-Venus 
reporter cassette®. In this model, Fltp promoter-driven histone 2B 


(H2B)-Venus reporter (FVR) activity is confined to terminally differ- 
entiated cells with established PCP®. We first analysed reporter activity 
in the pancreas of Fitp“”/* heterozygous embryos at embryonic day 
(E)18.5, when endocrine cells are still organized in cord-like struc- 
tures outside the ductal tubules and start to aggregate and form 3D 
mini-organs (Extended Data Fig. 1). Notably, we detected heterogene- 
ous reporter activity in 3-cells that expressed high levels of Nkx6.1 in 
compacted islet structures, but not in scattered pre-B-cells in cord-like 
structures that expressed low levels of Nkx6.1. This finding suggests 
that FVR is activated in endocrine cells during islet compaction and 
establishment of polarization. Because (B-cell maturation occurs in the 
first two weeks after birth?, we analysed reporter activity in postnatal 
and adult islets from heterozygous Fitp2"'* mice. At postnatal day (P)1, 
reporter activity was detectable in less than 50% of Nkx6.17 B-cells; 
activity was significantly increased at P11 and reached a maximum 
of 80% in adult islets (Fig. 1a, b). In adult islets, FVR was expressed 
in ~50% of glucagon-secreting a-cells, ~80% of insulin-secreting 
Nkx6.1* B-cells, ~50% of somatostatin-secreting 6-cells and ~50% of 
pancreatic polypeptide-secreting PP-cells (Fig. 1c, d). Together, these 
observations suggest that islet cell heterogeneity develops during islet 
neogenesis and is likely to be determined by the terminal differentiation 
and planar polarization status of endocrine cells. 

To understand the biological relevance of islet cell heterogeneity in 
more detail, we specifically focused on 3-cells in heterozygous Fltp2”/* 
mice. We first compared the proliferation rate of FVR~ and FVRt 
B-cells during homeostasis, postnatal (3-cell expansion and physiolog- 
ical insulin resistance. FVR” Nkx6.1* (-cells showed a significantly 
increased replication rate during the postnatal expansion phase at P1, 
at P11 and upon metabolic demand during pregnancy at gestational 
day (G)15.5, as measured by Ki-67 immunoreactivity (Fig. le, f) and 
EdU incorporation (Extended Data Fig. 2a, b), when compared to 
FVRt Nkx6.1* B-cells; by contrast, FVRt Nkx6.1+ B-cells showed 
increased levels of the cell-cycle inhibitor p27 (Extended Data 
Fig. 2h-j). When we compared the head and tail regions and different- 
sized islets, we noticed remarkable intra-islet and inter-islet heterogene- 
ity in terms of FVR marker expression and proliferation rate (Extended 
Data Fig. 2c-g), probably due to different cell-type compositions in 
the islet niches’®. Collectively, our data show that (}-cell subpopula- 
tions, depending on their niche and environmental conditions, show 
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Figure 1 | FVR expression increases during postnatal 6-cell maturation, 
is heterogeneous in endocrine cell lineages and is absent in B-cells with 
high proliferative capacity. a, b, Laser scanning confocal microscopy 
(LSM) images (a) and quantification (b) of FVR* 6-cell distribution 
during postnatal }-cell maturation (P1, P11, P25) and in adulthood 

(12 weeks (w)) (P1: n (mice) = 5 (14,766 Nkx6.1* cells), P11: (mice) = 6 
(13,789 Nkx6.1* cells), P25: n (mice) =5 (8,733 Nkx6.1* cells), 12 weeks: 
n (mice) = 9 (17,369 Nkx6.1* cells); ****P < 0.0001). c, d, LSM images 
(c) and quantification (d) of FVR expression in endocrine cell lineages 

in adulthood (n (mice) = 3, 1,676 PP* cells, 1,656 Sst* (8-)cells and 

1,975 Gcgt (a-)cells); n (mice for 3-cells) = 5, 3-cells (7,327 Nkx6.1* 
cells)). e, f, LSM images (e) and quantification (f) of Ki-67-expressing 
cells among FVR™ and FVR* (-cell subpopulations at P1, P11, 12 weeks 
and in pregnant mice (G15.5; arrowheads indicate proliferating cells; 

red arrowheads, Nkx6.1; green arrowheads, Venus and Nkx6.1; Pl: n 
(mice) =5 (14,766 Nkx6.1* cells), P11: n (mice) = 6 (13,789 Nkx6.1* 
cells), 12 weeks: n (mice) = 5 (10,042 Nkx6.1* cells) and pregnant 

G15.5: n (mice) =5 (12,691 Nkx6.1* cells); (P1) **P=0.0048, (P11) 
#**EED < ().0001, (G15.5) **P= 0.0039). Scale bars, 501m (a, c, e). Two- 
sided unpaired t-test was performed (b, e). Data represent mean +s.e.m. 
Source data for all graphs and quantifications are provided online. 


markedly different FVR expression levels correlating with their pro- 
liferative capacity. 

The data presented so far suggest that the expression of FVR probably 
subdivides 3-cells into proliferative and more mature (}-cells. To test this 
idea directly, we isolated adult islets from Fltp“”'* mice and purified the 
FVR endocrine subpopulations for genome-wide mRNA expression 
profiling (Fig. 2a, b). We found 997 genes that showed more than 1.5- 
fold differences in expression between FVR™ and FVR* endocrine cells 
(Fig. 2c and Supplementary Table 1). Strikingly, unbiased gene ontol- 
ogy term (Fig. 2g and Supplementary Table 2) and ingenuity analysis 
(Fig. 2h and Supplementary Table 3) revealed that the FVR™ subpopula- 
tion showed significant enrichment of genes associated with G-protein 
coupled receptor (GPCR), Wnt and MAPK signalling transduction, 
whereas the FVR* subpopulation showed significant enrichment of 
genes that are important for mature 6-cell function. This difference is 
reflected by differential mRNA expression of hormones, }-cell mat- 
uration markers, mitochondrial and metabolic genes, receptors and 
signalling components, as revealed by microarray analysis (Fig. 2d—f) 
and confirmed by qPCR analysis on highly enriched (-cell subpop- 
ulations (Extended Data Fig. 3a—h). Mitochondrial gene expression 
(Fig. 2e, g, h), ultrastructural (Extended Data Fig. 4) and biochemical 
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analysis (Fig. 2k, 1), and mitochondrial network architecture (Extended 
Data Fig. 5e-k) collectively suggest that the two isolated 3-cell sub- 
populations have similar numbers of mitochondria (Extended 
Data Fig. 3i) but that the mitochondria differ in architecture and 
physiology. Measuring the oxygen consumption rate of sorted FVR™ 
and FVR®* (-cells from Fitp“/* islets using an XF96 extracellular 
flux analyser revealed that glucose-induced ATP synthesis and glu- 
cose responsiveness were not significantly different between the two 
subpopulations (Extended Data Fig. 5a—d), whereas static glucose- 
stimulated and arginine-stimulated insulin secretion were significantly 
enhanced in the sorted and reaggregated FVR™ (3-cell enriched sub- 
population (Extended Data Fig. 51). These results are consistent with 
the differences in the expression of genes involved in glucose sensing 
and insulin secretion (Fig. 2 and Extended Data Fig. 3g). Finally, the 
high expression of the proprotein convertase subtilisin/kexin type-1 
gene (Pcsk1) in the FVR* cells (Extended Data Fig. 3g) and the higher 
number of mature secretory granules in Fltp-lineage* cells (Fig. 2i, j 
and see below) further support the idea that FVR* and Fltp-lineage* 
cells are mature }-cells. 

The differential expression of signalling components and the dis- 
tinct physiologies of the two B-cell subpopulations suggested that they 
might react differently to environmental changes. To test this hypoth- 
esis, we used a Cre recombinase/loxP-mediated genetic lineage tracing 
approach, which allowed us to follow the cell populations over time 
(Fig. 3a). We isolated Fltp?Aicre’ +*;Gt(ROSA)26"""S islets (refs 17, 18) 
and performed in vitro single cell-tracking experiments. This showed 
that Fltp-lineage~ endocrine cells turned into Fltp-lineage* cells 
(Fig. 3b). As FVR™ and FVR* (3-cells are molecularly and biologically 
distinct, these results suggest that FVR™ precursor (3-cells differentiate 
into FVR* mature (-cells. Notably, the FVR™ (-cell enriched subpop- 
ulation expressed low levels of Slc2a2 (Extended Data Fig. 3g), which 
has been recently shown to be expressed in insulin-positive pancreatic 
multipotent precursor cells in mouse and human islets** 

To analyse the in vivo 8-cell subpopulation dynamics at the time 
when compensatory {3-cell proliferation reaches peak levels!”, we iso- 
lated adult islets from non-pregnant female mice and from pregnant 
female mice at G15.5 (Fig. 3c, d). Quantitative analysis showed that the 
Fltp-lineage™ 6-cell subpopulation compensates for the increased met- 
abolic demand in pregnant mice by proliferating. The higher number 
of mImG double-positive cells in pregnant mice further suggested that 
Fltp-lineage™ cells differentiate into mature (-cells in vivo. Next, we 
transplanted Flip T2aicre/ *;Gt(ROSA)26""" islets into the anterior cham- 
ber of the eye in mice for noninvasive longitudinal in vivo imaging”. 
After donor islet isolation and transplantation onto the iris, islets first 
shrank in volume during engraftment, owing to the lack of nutritional 
supply, before revascularization counteracted islet degeneration 
(Extended Data Fig. 6d). Concomitant with the revascularization and 
decrease in islet size we observed a relative increase in the Fltp-lineage™ 
and decrease in the Fltp-lineaget subpopulation volume (Fig. 3f, g) and 
number (Extended Data Fig. 6e) 13 days after transplantation, suggest- 
ing that the more mature and metabolic active endocrine subpopulation 
undergoes apoptosis, probably due to hypoxia, until revascularization. 
Four weeks after transplantation the Fltp-lineage* subpopulation had 
recovered, probably owing to the maturation of Fltp-lineage™ cells into 
mature Fltp-lineage* cells (Fig. 3e). Finally, we investigated the endo- 
crine subpopulation dynamics in a high-fat diet (HFD) mouse model. 
HFD causes insulin resistance and compensatory 8-cell hyperplasia 
followed by hypertrophy, but the exact pathomechanism of 3-cell failure 
remains unclear!. Over an eight-week period, we observed increased 
glucose intolerance (Extended Data Fig. 6c) and a steady increase 
in islet volume accompanied by a significant increase in the cross- 
sectional area of the Fltp-lineage* endocrine cell population, indi- 
cating that mature endocrine cells are more metabolically active and 
prone to hypertrophy commonly associated with cytotoxic stress 
(Fig. 3h-k and Extended Data Fig. 6a, b). This observation might 
explain why the Fltp-lineage* subpopulation predominantly decreased 
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Figure 2 | FVR distinguishes between two endocrine subpopulations 
with distinct molecular, physiological and ultrastructural features. 

a, Experimental design for microarray analysis of endocrine 
subpopulations from isolated Fitp2”’* islets. b, Fluorescent activated cell 
sorting (FACS) separation of FVR* and FVR™ endocrine subpopulations. 
c, Scatter plot of the 997 genes that were differentially regulated in FVRt 
and FVR’ endocrine cells (1.5-fold regulated, P < 0.01, limma t-test). 

d-f, Heatmaps depicting regulated genes among FVR subpopulations 
involved in 6-cell function (d), mitochondrial function (e) and receptors 
and signalling (f) (*validated by qPCR, median centre was applied to each 
gene of the heatmaps across different samples). g, h, Bar graph of selected, 
significantly enriched gene ontology terms (g) and ingenuity pathways (h) 


after transplantation (Fig. 3g). Together, our results suggest that, in 
pregnant mice, Fltp~ endocrine cells undergo compensatory proliferation, 
whereas more mature Fltp-lineage* endocrine cells probably account for 
islet cell hypertrophy and seem more sensitive to cytotoxic stress. 

In our molecular analysis we noticed that apical—basal polarity and 
expression of Wnt/PCP genes were increased in FVR* 3-cells (Fig. 2 and 
Extended Data Fig. 3g, j). Notably, cellular polarization and compaction 
in 3D pseudo-islet cultures”? strongly induced protein expression of a 
variety of 3-cell maturation markers, including Nkx6.1, MafA and Ucn3, 
as revealed by quantitative PCR (qPCR) and immunocytochemistry 
in time-course experiments in Min6 insulinoma cells (Extended Data 
Fig. 7a-j). Consistent with the induction of the Wnt/PCP effector Fltp 
and Nkx6.1 during islet neogenesis (Extended Data Fig. 1), we observed 
that treatment of dispersed and reaggregated P5 islet cells (Fig. 4a—c) 
and pseudo-islets of Min6 insulinoma cells (Extended Data Fig. 7k-n) 
with the Wnt/PCP ligand Wnt5a significantly upregulated 3-cell 
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of regulated genes among FVR subpopulations shown in c. 

i, j, Transmission electron microscopy images (i; scale bars, 1 nm) and 
quantification (j) of insulin granules in immunogold-labelled 3-cells 
from Fitp'?4'C’+;Gt(ROSA)26™"/* mice (m, mature; im, immature) (n 
(independent experiments) = 5, n (B-cells) = 50, n (granules) = 12,254; 
** P = 0.0015, two sided unpaired t-test). k, 1, Western blot (k) and 
quantification (1) of mitochondria complexes I-V from FACS-sorted 
FVR* and FVR™ subpopulations (protein expression normalized to 
-tubulin and to the FVR™ population (dotted line), n (independent 
experiments) = 5, whiskers represent minimum and maximum values). 
Data in j represent mean + s.e.m. See Supplementary Fig. 1 for gel source 
data. 


maturation makers. Human microislets and human islets show heter- 
ogenous expression of WNT4 in endocrine and -cells, consistent with 
the differential expression of Wnt4 and Wnt5b in the FVR- and FVR* 
subpopulations (Fig. 4d-g and Extended Data Fig. 3g). Treatment of 
human EndoC-§ H1 (3-cells with the canonical Wnt ligand WNT3A 
increased proliferation, whereas treatment with WNT4 or WNT5A did 
not (Extended Data Fig. 8a-e). Notably, WNT4 induced JNK activation 
(Extended Data Fig. 8f, g) and increased levels of NKX6.1 and PDX1 
protein in human EndoC- H1 (3-cells (Fig. 4h, i) and human islets 
(Fig. 4j, k), suggesting that WNT4 triggers the non-canonical Wnt/ 
PCP pathway. Finally, WNT5A increased glucose-stimulated insu- 
lin secretion (GSIS) in human microislets (Extended Data Fig. 8h). 
Together, these results suggest that 3D architecture and Wnt/PCP path- 
way activation induce 3-cell maturation and increase GSIS. 

Finally, we investigated whether the Wnt/PCP effector Fltp is 
required for 3-cell function beyond its role as a maturation marker. 
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Figure 3 | Fltp genetic lineage tracing reveals differential responses 

of endocrine and 8-cell subpopulations upon physiological and 
pathological insulin resistance. a, Fltp’“'""/*;Gt(ROSA)26™""/+ 
lineage-tracing model. Fltp lineage™ cells (mT, red) convert into Fltp 
lineage* cells (mG, green) upon Fltp-promoter driven Cre expression via 
an intermediate (mTmG, yellow) cell state. b, e, Time-lapse imaging of 

in vitro cultured Fltp??“'"’*;Gt(ROSA)26"'"“’* islets (b; n (cells) =7) 
and in vivo islets transplanted into the anterior chamber of the eye (AC) 
using laser scanning confocal microscopy (LSM) (e) shows conversion 
of endocrine Fltp lineage” cells (mT) into Fltp lineage* cells (mG) 
(white dotted line shows converting cells). c, d, FACS analysis (c) and 
quantification (d) of mT, mImG and mG Nkx6.1* 6-cells from control 
(Ctrl) and pregnant mice (G15.5) (n (mice per group) = 4, ***P=0.0001, 
***P — 0.0003, Bonferroni's multiple comparison test). f, g, LSM images 
(f) and quantification (g) of a Fitp’?4/"/*;Gt(ROSA)26™"“/* islet 
transplanted into the AC over engraftment time (n (islets) = 15 islets 
from 4 mice; **P = 0.0086, two sided unpaired t-test). h, i, LSM images 
of a Fltp’?4'"'’’*;Gt(ROSA)26™™°'* islet in the AC before (Pre-diet; 

h) and after 4 weeks on a high-fat diet (HFD; i). j, k, Quantification 

of mT and mG cell cross-sectional areas (j) and compartment 

volume change among mT and mG endocrine cells (k) of intraocular 
Fitp'4'C/+,Gt(ROSA)26"™"/* islets before and during HED (4 w, 8 w) 
(j, n (islets) = 34, *P = 0.0112, **P = 0.00732; k, n (islets) = 34, *P = 0.026, 
**** P< 0.0001, Sidak’s multiple comparison test). Scale bars, 10 1m (b); 
20 um (e, f); 50,1m (h, i). Data shown as mean + s.e.m. 


Terminally differentiated cell types in the inner ear and lung depend 
on Fltp function as a modulator of the actin and microtubule 
cytoskeleton® and mature $-cells require cell polarity”* and the 
cytoskeleton”® for proper insulin secretion. Fltp knockout (Fltp2"’7") 
B-cells do not show impaired development, proliferation or maturation 
(Extended Data Figs 9, 10a, c, d) or altered glucose tolerance or insulin 
sensitivity (Extended Data Fig. 10b, e). However, when we analysed 
basal glucose and insulin levels in adult mice we found small but sig- 
nificantly reduced levels in starved Fitp*"/“" mice, suggesting that 
Fltp knockout causes a minor defect in insulin secretion (Extended 
Data Fig. 10b, f). To rule out systemic effects on basal insulin levels in 
Ftp?” ZV animals, we isolated islets from adult mice and performed GSIS. 
Consistent with the finding that insulin secretion depends on cell polar- 
ity and the cytoskeleton”*”> we found reduced GSIS in Fitp*"’" islets 
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Figure 4 | The Wnt/PCP pathway triggers 3-cell maturation. a, b, LSM 
images (a) and quantification (b) of Ucn3 protein in reaggregated islet cells 
from P5 wild-type mice treated with or without Wnt5a (n (independent 
experiments) = 3 (DAPI* cells control = 680, DAPI* cells Wnt5a = 877), 
* P = 0.001, two sided unpaired t-test). c, Wnt5a stimulation of mRNA 
expression of }-cell maturation markers in P5 reaggregated islet cells 

(n (independent experiments) = 4, *P < 0.05, two sided unpaired f-test). 
d-g, LSM images of a human islet (d, e) and a human microislet (f, g) 
showing heterogeneity of WNT4-expressing NKX6.1* 8-cells (white 
dotted lines and arrowheads). h, i, Western blot (h) and quantification (i) 
of NKX6.1 and PDX] in EndoC-3 H1 8-cells after 4 days of treatment with 
WNT4 and WNT5SA (n (independent experiments) = 5; *P = 0.030 for 
PDX1, *P=0.0349 for NKX6.1, two-sided paired t-test). j, k, Western 
blot (j) and quantification (k) of PDX1 expression in human islet after 

4 days of treatment with WNT4 (n (independent experiments) = 9, 4 
human donors, two-sided paired t-test). 1, m, RNA sequencing expression 
data of FLTP and PDX1 in human pancreatic islets from healthy donors 
(normal glucose tolerance (NGT), n = 66), pre-diabetic donors (impaired 
glucose tolerance (IGT), n = 21) and type 2 diabetic donors (T2D, 

n= 19; one-way ANOVA). Scale bars, 101m (a, d-g). Data represent 
mean + s.e.m. See Supplementary Fig. 1 for gel source data. 


(Extended Data Fig. 10g). To investigate whether the human FLTP 
orthologue (CFAP126, also known as Clorf192) is associated with 
metabolic traits, we genotyped seven single nucleotide polymor- 
phisms (SNPs) tagging all the common variation in this gene (minor 
allele frequency > 0.01) and analysed their associations with insulin 
secretion, insulin sensitivity and body fat content and distribution in 
a cohort of 2,228 human prediabetic subjects”®. The results showed 
that the intronic SNP rs75715534 (minor allele frequency 0.037) was 
significantly associated with increased insulin secretion in lean subjects 
(body mass index (BMI) < 25), whereas the same SNP was associated 
with decreased insulin secretion in obese people (BMI > 35) (Extended 
Data Fig. 8j, k and Supplementary Table 4). CFAP126, SLC2A2 and 
PDX1 were significantly downregulated in isolated human islets from 
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pre-diabetic (IGT) when compared to non-diabetic (NGT) subjects and 
further downregulated in islets from type 2 diabetic (T2D) individuals 
(Fig. 41, m and Extended Data Fig. 8i), suggesting that Wnt/PCP signal- 
ling and {-cell maturation or function are impaired by glucolipotoxicity. 

We have demonstrated that proliferative and mature }-cells can be 
distinguished by expression of the Wnt/PCP effector and reporter gene 
Fitp. Several studies have indicated that loss of 3-cell identity and mat- 
uration underlies the pathogenesis of type 2 diabetes”~*°. The availa- 
bility of a unique marker for mature 3-cells opens new ways to explore 
pathways regulating islet heterogeneity and might allow researchers to 
target B-cell maturation for cell-replacement and regenerative therapy. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Animal studies. Animal experiments were carried out in compliance with 
the German Animal Protection Act and with the approved guidelines of 
the Society of Laboratory Animals (GV-SOLAS) and of the Federation of 
Laboratory Animal Science Associations (FELASA). Mouse lines used: Fitp7Y 
(C57BL/6J; ref. 6) and Fltp’4'/+ (mixed C57BL/6J, CD1 background; ref. 17) 
crossed with Gt(ROSA)26"!" (129/SvJ, C57BL/6] mixed background; ref. 18) 
(Fltp??4'"'’+Gt(ROSA)26""°). Albino B6 mice (B6N-Tyre/BrdCrCrl; Charles 
River) crossed with B6 Ragl~/~ mice (B6.129S7-Ragltm1Mom/J; Jackson 
Laboratories) were used as recipients for Bip ner +*;Gt(ROSA)26""G islet 
transplants. All experiments were performed using 3-6-month-old mice, unless 
indicated otherwise. Islet isolation was carried out by collagenase P (Roche) diges- 
tion and centrifugation using Optiprep density gradient (Sigma) as described". 
Isolated islets were handpicked twice under a microscope. For analysis of islet 5-cell 
proliferation during pregnancy, mice were euthanized at G15.5 and for histological 
analysis EdU was administrated 24h before sacrifice. For HFD analysis, fourteen 
weeks after islet transplantation, mice were divided into two groups and either 
continued on a normal diet (Ssniff GmbH) or switched to HFD (60 kcal % fat; 
Research Diets, Inc.) with ad libitum access to water and chow. 

For intraperitoneal glucose tolerance tests (IPGTTs) and insulin tolerance test 

(ITT) mice were fasted for 6h and injected intraperitoneally with 2 g glucose per 
kg (BW) or 0.75 U insulin per kg (BW), respectively. Blood glucose was measured 
at indicated time points after glucose injection using Accu-Chek Aviva glucometer 
(Roche). 
Cell culture and human islets. Min6 (clone 9) murine 3-cell line and EndoC-$ H1 
were cultured in adherence (2D culture) as described previously****. For floating 
3D (3D) culture Min6 were plated on uncoated ibidi chambers (ibidi) for 3, 6 or 
7 days. For 3D matrigel based cultures, Min6 and EndoC-3 H1 were cultured in 
Growth-Factor-Reduced Matrigel Matrix (BD Bioscience, Germany) diluted 1:1 
in their respective medium. All the cell lines used were tested for mycoplasma 
and were negative. Isolated islet cells from P5 pancreata were cultured overnight 
in RPMI 1640 containing P/S and 5% FCS. For analysis of Wnt5a induced {-cell 
maturation, samples were stimulated with 400 ng/ml of Wnt5a (R&D systems, 
Germany) for 12h, 3 or 6 days in RPMI1640 supplemented with P/S and 3% FCS. 
Human islets isolated from cadaver and pancreatic microislets were obtained 
from the JDRF award 31-2008-416 (ECIT Islet for Basic Research program) and 
from InSphero (Switzerland), respectively. All experiments with human islets 
were approved by the Ethical commission of the Technical University of Munich 
(Germany). Human islet donor data: 6 donors, 3 males and 3 females; age mean 
53.6 yrs (range: 41-65); BMI, mean 24.25 (range: 20.8-30.1); purity mean 78 (range 
89-45%). Human microislet donor data: 3 donors, 2 female and 1 male; age mean 
40.10 yrs (range: 37-56); BMI mean 23.19 (range: 22.7-23.56). 

Upon arrival, human islets were collected by centrifugation, selected by hand 
picking and cultured in CMRL1066 (Gibco, Thermo Fisher Scientific) sup- 
plemented with 10% human serum (Sigma), 1% P/S and 2mM 1-glutamine. 
Pancreatic microislets were cultured according to the manufacturer's instruction. 
Human islets were treated with 100 ng/ml WNT4 (R&D systems) or 400 ng/ml 
WNT5A in CMRL 1066 supplemented with 2% human serum albumin (Sigma), 
1% P/S, 2mM L-glutamine, 5.5mM glucose, 100 ng/ml mouse Noggin (Peprotech) 
and 10,.M Y-27632 (Santa Cruz) for 4 days (medium was changed every 2 days). 

Microislets were treated with 100 ng/ml WNT4 (R&D Systems) or 400 ng/ml 
WNT5A in CMRL 1066 supplemented with 2% human serum albumin, 1% P/S, 
2mM 1-glutamine, 5.5 mM glucose and 100 ng/ml mouse Noggin (Peprotech) for 
4 days (medium was changed every 2 days). 

EndoC-( H1 cells were treated with 100ng/ml WNT4 (R&D systems) or 400 ng/ml 
WNT5A or 200ng/ml WNT3A (Peprotech) in their culture medium? supple- 
mented with 100 ng/ml mouse Noggin (Peprotech) and 10|1M Y-27632 (Santa 
Cruz) for 4h or 4 days (medium was changed every 2 days). 

Islet transplantation and in vivo imaging of islets engrafted in the anterior 
chamber of the eye. For islet transplantation into the anterior chamber of the 
mouse eye, mice were anaesthetized by inhalation of 2% isoflurane in 100% oxygen 
via a face mask. The mouse head was fixed in a head holder and a 25-gauge needle 
used to make a small incision into the cornea, close to the corneal limbus. Next, 
30-40 islets in PBS were slowly injected into the anterior chamber of the eye using 
a custom made beveled glass cannula (outer diameter 0.4mm, inner diameter 
0.32 mm; Hilgenberg GmbH). For in vivo imaging mice were intubated (BioLite, 
Braintree Scientific, Inc.) and anaesthetized by 2% isoflurane in 100% oxygen with 
27011 stroke volume at 250 strokes per min for <90 min. In vivo imaging was 
performed as previously described*. For confocal and two-photon imaging, an 
upright laser-scanning microscope (LSM780 NLO; Zeiss) with a two-photon laser 
(Chameleon Vision II; Coherent, Inc.) and W Plan-Apochromat 20 x/1.0 DIC M27 
75mm objective (Zeiss) was used. Imaging of intraocular islets was performed 
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during islet engraftment (days 3, 13 and 4 weeks post-transplantation) and repeated 
four and eight weeks after diet switch. Backscattered laser light was detected at 
633 nm. Total islet volume was calculated using surface rendering (Imaris 7.4, 
Bitplane AG). GFP (mG) and Tomato (mT) were excited by two-photon laser at 
930nm and detected at 500-550 nm and at 575-610 nm, respectively. mG and mT 
islet fractions were assessed from Gaussian filtered z-stacks (step size of 1.5|1m) 
of the whole islet for the revascularization or within the top 801m of the islet for 
the HED time points using surface rendering (Imaris 7.4). Compartment volumes 
were calculated in relation to total islet volume obtained from backscatter imaging. 
Vessels were visualized by injecting 0.8 1M Qtracker 705 (Life Technologies) in 
100.1 PBS into the tail vein and detected at 690-730 nm. mG and mT cell cross- 
sectional areas were assessed manually in individual optical planes of the islet 
z-stacks (Fiji software)”. 

Transmission electron microscopy. Immunogold staining was performed on 
isolated islets from Fltp’4“'“"’*;Gt(ROSA)26"" “+ mice as follows. After islet 
isolation the islets were cultured overnight in RPMI1640 supplemented with 
10% FCS and P/S, then fixed in 2.5% electron microscopy grade glutaralde- 
hyde in 0.1 M sodium cacodylate buffer pH 7.4 (Science Services), stained with 
rabbit anti-GFP (Invitrogen, A11122, 1:500) and a gold-conjugated goat anti- 
rabbit antibody (Aurion, 806.011, 1:40). Isolated FACS-sorted FVR islet cells or 
immunogold-stained islets were fixed in 2.5% electron microscopy grade glutar- 
aldehyde in 0.1 M sodium cacodylate buffer pH 7.4 (Science Services) as pellets, 
post-fixed in 2% aqueous osmium tetraoxide*®, dehydrated in gradual ethanol 
(30-100%) and propylene oxide, embedded in Epon (Merck) and cured for 48h 
at 60°C. Semi-thin sections were cut and stained with toluidine blue. Ultrathin 
sections of 50 nm were collected onto 200 mesh copper grids, stained with uranyl 
acetate and lead citrate before examination by transmission electron microscopy 
(Zeiss Libra 120 Plus, Carl Zeiss NTS GmbH). Pictures were acquired using a Slow 
Scan CCD-camera and iTEM software (Olympus Soft Imaging Solutions). Serial 
pictures for image analyses were acquired by transmission electron microscopy 
(Zeiss EM10CR, Carl Zeiss NTS GmbH) with a MegaView III camera system 
(Olympus Deutschland GmbH). Images were quantified using image analysis 
software Definiens Developer XD2 (Definiens AG) according to a previously 
published procedure”. Initially mitochondria were annotated manually. A rule 
set was developed in order to detect and quantify the length of the inner mitochon- 
drial membrane within these marked mitochondria, based on intensity, morphol- 
ogy and neighbourhood. Additionally the area for each mitochondrion and the 
number of mitochondria per cell area were calculated. The B-cells were identified 
as Fltp lineage* or Fltp lineage” according to their immuno-gold labelling. Insulin 
granules were counted after their classification as immature or mature granules 
according to their morphology. 

Immunofluorescence staining and western blot. For immunohistochemistry, 
pancreas samples were fixed in 4% formalin, cryoprotected by incubation in a 
progressive sucrose gradient and embedded in Optimum Cutting Temperature 
(OCT). Sorted cells, Min6 and EndoC-$ H1 were fixed in 4% formalin before 
staining. For whole-mount staining, embryos were cleared using BABB solution 
and stained using a modified previously described protocol’. 

Primary antibodies used for immunostaining: goat anti-Nkx6.1 (R&D 
system, AF5857, 1:400), rabbit anti-Nkx6.1 (Acris/Novus, NBP1-82553, 1:100), 
rat anti-RFP (Chromotek, ORD003515, 1:1000), chicken anti-GFP (Aves Labs, 
GFP-1020, 1:800), guinea pig anti-glucagon (Millipore, 4031-01F, 1:500), goat 
anti-somatostatin (Santa Cruz, sc-7819, 1:300), rabbit anti-insulin (Thermo 
Scientific, PA-18001, 1:200), guinea pig anti-insulin (Thermo Scientific, PA-26938, 
1:300), goat anti-pancreatic polypeptide (PP) (Abcam, ab77192, 1:300), rabbit 
anti-Ki-67 (Abcam, ab15580, 1:200), rabbit anti-MafA (Bethyl, IHC-00352- 
1, 1:100), mouse anti-3-catenin (BD, 610154, 1:2000), rabbit anti-Urocortin 3 
(Phoenix Pharmaceuticals, H-019-29, 1:300), rabbit anti-p27 (Santa Cruz, sc-528, 
1:100), goat anti- Wnt4 (R&D Systems, AF475, 1:100), Alexa Fluor 546 phalloidin 
(Invitrogen, A22283, 1:200), rat anti-E-cadherin (raised by E. Kremmer (Helmholtz 
Zentrum Miinchen), 1:200), MitoTracker Deep Red FM (Life Technologies, 
M22426, final concentration in the medium: 200 nM). Secondary antibodies used 
for indirect fluorescence staining (dilution always 1:800): goat anti-chicken Alexa 
Fluor 488 (Dianova, 103-545-155), donkey anti-goat Alexa Fluor 488 (Invitrogen, 
A11055), donkey anti-mouse Alexa Fluor 555 (Invitrogen, A31570), donkey anti- 
goat Alexa Fluor 555 (Invitrogen, A21432), donkey anti-rabbit Alexa Fluor 555 
(Invitrogen, A31572), donkey anti-goat Alexa Fluor 594 (Invitrogen, A11058), 
donkey anti-mouse Alexa Fluor 594 (Invitrogen, A21203), donkey anti-guinea-pig 
Alexa Fluor 649 (Dianova, 706-495-148). Nuclear staining was performed with 
DAPI (Life Technology, 1:800). EdU staining was performed using the Click-iT 
Staining Kit (Life Technology) according to the manufacturer’s instructions. For 
analysis of proliferation of EndoC-( H1, cells were pulsed with 10,1M EdU for 2 
days. Cryosection imaging was performed using a Leica SP5 confocal microscope. 
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Analysis was performed either manually or by using IMARIS software (Bitplane) 
or the Leica LAS-AF (Version 2.7.3.9723) software. For western blot analysis of 
mitochondrial complexes, islet cells were isolated and FACS sorted in RIPA buffer. 
Cell lysates were resolved by SDS-PAGE, transferred to PVDF membrane (Biorad) 
and incubated with the following primary antibodies: anti-Rt/Ms Total OxPhos 
Complex Kit (Life technologies, 458099, 1:250) and 4-tubulin (Abcam, ab11316, 
1:10.000). For western blot analysis of Fltp and Nkx6.1 expression, tissues and Min6 
were dissociated in RIPA buffer. Cell lysates were resolved by SDS-PAGE, trans- 
ferred to PVDF membrane (Biorad) and incubated with the following antibodies: 
rabbit anti-Fltp** (1:500), goat anti-Nkx6.1 (R&D system, AF5857, 1:2.000) and 
mouse anti-7y-tubulin (Abcam, ab11316, 1:10.000). For western blot analysis of 
pJNK, NKX6.1 and PDX1 expression, human islet and EndoC-3 H1 were dissoci- 
ated in RIPA buffer. Cell lysates were resolved by SDS-PAGE, transferred to PVDF 
membrane (Biorad) and incubated with the following primary antibodies: Rabbit 
SAPK/JNK (56G8) (New England Biolabs, 9258, 1:1.000), rabbit SAPK/JNK- 
Phospho (New England Biolabs, 4668, 1:1.000), goat anti-NKX6.1 (R&D Systems, 
AF5857, 1:2.000), rabbit anti-PDX1 (New England Biolabs, 5679, 1:500) and mouse 
anti GAPDH (Merck Bioscience, CB 1001, 1:5.000-10.000). Protein bands were 
visualized using the following horseradish peroxidase (HRP)-conjugated antibody: 
Goat anti-mouse HRP (Dianova, 115-036-062, 1:10.000), rabbit anti goat HRP 
(Dianova, 305-035-045, 1:5.000), goat anti rabbit HRP (Dianova, 111-036-045, 
1:5.000) and chemiluminescence reagent (Millipore). The bands were quantified 
using Image]. 

Fluorescent intensity analysis. Nkx6.1 and MafA fluorescent intensity analysis 
of Min6 was performed with Imaris software, using the spot function. Spots of the 
same size were chosen and the mean fluorescence within these spots was calculated. 
The median fluorescent intensity was used as a threshold to distinguish between 
Nkx6.1-high and -low cells. 

To show the fluorescent intensity profiles a line was drawn and the fluorescence 
intensity over this line calculated and plotted against the length of the line. 

To calculate the Fltp reporter fluorescent intensity of the BABB cleared pan- 
creata Imaris software was used. A surface on the Nkx6.1 positive cells was calcu- 
lated and the mean Fitp reporter fluorescent intensity was calculated within the 
described surfaces. 

Nkx6.1, Venus, EdU and p27 fluorescent intensity analysis of pancreatic islet 
sections were performed with Imaris software, by using the surface function. 
Surfaces using the same threshold were rendered for the Nkx6.1 positive cells 
and the mean fluorescence intensity within these surfaces was calculated for 
Venus, EdU and p27. To discriminate between p27-high and -low cells, the total 
mean fluorescent intensity of all islets of one mouse was used as a threshold. The 
threshold for EdU was set by eye. 

Mitochondria network analysis and oxygen consumption rate. The mitochon- 
dria network was analysed using Imaris. The mitochondrial surface was rendered 
by thresholding the Mitotracker Deep Red FM (Thermo Fisher Scientific) intensity. 
The analysis was performed on nine Nkx6.1-Venus double-positive cells and on 
four Nkx6.1 single-positive cells from two islets cultured in culture medium and 
on nine Nkx6.1-Venus double-positive cells and on six Nkx6.1 single-positive cells 
from three islets exposed to 2mM glucose for 2h. For the flux analysis experiment, 
FVR* and FVR™ B-cells were sorted and recovered. Around 30,000 single cells 
were plated in a 96-well plate for flux analysis using 2 jul Growth-Factor-Reduced 
Matrigel Matrix (BD Bioscience) diluted 1:2 in medium per well and rested over- 
night. The cells were starved for 2h in Hank’s Salt (HBSS) supplemented with 2mM 
glucose and 2mM glutamax (Thermo Fisher Scientific). Oxygen consumption 
was measured using an XF96 extracellular flux analyser (Seahorse Bioscience) 
with sequential injections of 13 mM glucose, 41M oligomycin (Sigma) and 101M 
antimycin + 4|1M rotenone (Sigma). 

FACS analysis, gene profiling and endocrine cell reaggregation. For gene pro- 
filing, qPCR, transmission electron microscopy and western blotting, cells from 
isolated islets were sorted and analysed using FACS-Aria III (BD Bioscience). The 
percentage and purity of both endocrine populations were controlled by cytospins 
and quantitative PCR (qPCR) and 90% purity of Nkx6.1* B-cell subpopulations 
was achieved using an optimized FACS sorting scheme. For FACS analysis, cells 
were fixed in 4% PFA and intracellular staining for Nkx6.1 was performed by 
using goat anti-Nkx6.1 (R&D Systems, AF5857, 1:100) and donkey anti-goat APC 
(Dianova, 705- 136147, 1:1.000). The ratio of Fltp lineage* and Fltp lineage” pop- 
ulations during pregnancy was calculated over the Nkx6.1* cells. The results were 
analysed using FlowJo software. For qPCR on sorted populations, total RNA was 
extracted from FACS-sorted endocrine cells, using a miRNeasy micro kit (Qiagen) 
and amplified with the Ovation PicoSL WTA System. For gene profiling, total RNA 
was extracted from FACS-sorted FVR endocrine cells, using a miRNeasy micro 
kit (Qiagen) and amplified with the Ovation PicoSL WTA System V2 in combi- 
nation with the Encore Biotin Module (Nugen). Amplified cDNA was hybridized 


on Affymetrix Mouse Gene 1.0 ST arrays. Staining and scanning were performed 
according to the Affymetrix expression protocol including minor modifications 
as suggested in the Encore Biotion protocol. Expression console (v.1.3.0.187, 
Affymetrix) was used for quality control and to obtain annotated normalized 
RMA (robust microarray analysis) gene-level data (standard settings including 
median polish and sketch-quantile normalization). TaqMan qPCR was assessed 
using the probe list provided in Supplementary Table 5, according to the manu- 
facturer’s instructions. For quantification of mitochondrial DNA copy number, 
DNA from sorted cells was extracted using Qlamp DNA Blood Mini kit (QiAgen) 
according to the manufacturer's instructions. The ratio of mtDNA to nDNA was 
determined by Syber green qPCR (Dynamo Flash Syber green, Byozym) using the 
primers listed in Supplementary Table 5. For endocrine cell re-aggregation, FVR* 
and FVR -enriched 3-cells were FACS sorted and cultured by shaking in DMEM 
(containing 4.5 g/l glucose, 5% FCS and 1% v/v penicillin/streptomycin) diluted 
1:1 in conditioned medium (DMEM containing 4.5 g/l glucose, 5% FCS and 1% 
v/v penicillin/streptomycin) from MS1 cells (pancreatic islets endothelial cell line, 
ATCC CRL-2279, LGC Standards GmbH). Cells were aggregated for 6 days; 101M 
Y-27632 (Santa Cruz) was added on day 1 and left for the first 3 days, at which point 
the medium was changed. One day before the GSIS assay, the cells were cultured in 
DMEM (containing 11 mM glucose, 5% FCS and 1% v/v penicillin/streptomycin). 
GSIS assays and pancreatic insulin and glucagon concentration. Isolated murine 
islets were allowed to recover overnight in islet medium (DMEM containing 
11mM glucose, 10% v/v FBS, 1% v/v penicillin/streptomycin). For the static GSIS 
assay, islets of the same size were handpicked and assayed with different glucose 
concentrations as previously described’. For static GSIS assays on endocrine 
aggregates, modified Krebs Ringer phosphate Hepes (KRPH) buffer supplemented 
with 0.1% BSA was used. The aggregates were incubated for 30 min in KRPH with 
2.8mM glucose and sequentially incubated with different glucose concentrations 
as indicated. For static GSIS assays on human microislets, sequential incubation 
with 2.8mM glucose and 16.8 mM glucose in KRPH buffer was assessed for 2h 
each. Assessment of pancreatic insulin and glucagon concentration was performed 
by acid ethanol extraction. Murine insulin and glucagon concentrations were 
determined using Ultrasensitive Insulin ELISA kit (Cristal Chem) and Mouse 
Glucagon ELISA Kit (Cristal Chem), respectively, and normalized over protein 
content. Human insulin concentrations were determined using an Ultrasensitive 
Insulin ELISA kit (Mercodia). 
Association analysis. The study population consisted of 2,228 white individuals 
at risk for type 2 diabetes (family history of type 2 diabetes, body mass index 
(BMI) > 27 kg/m?, impaired fasting glycaemia and/or previous gestational 
diabetes) recruited from the ongoing Tiibingen Family study for type 2 diabetes. 
All participants underwent assessment of medical history, smoking status and 
alcohol consumption habits; the subjects furthermore agreed to undergo physical 
examination, routine blood tests and oral glucose tolerance tests (OGTTs). Only 
individuals with complete phenotypic and genotypic data sets and documented 
absence of medication known to influence glucose tolerance, insulin sensitivity 
or insulin secretion were included. All study participants gave informed written 
consent to the study and protocols adhered to the Declaration of Helsinki. 

The study protocol was approved by the Ethics Committee of the Eberhard 
Karls University, Tiibingen. 
OGTT and laboratory measurements. A standardized 75 g OGTT was performed 
following a 10h overnight fast. For the determination of plasma glucose, insulin 
and C-peptide levels, venous blood samples were drawn at baseline and at time- 
points 30, 60, 90 and 120 min of the OGTT”. Plasma glucose levels (in mmol/l) 
were measured with a bedside glucose analyser (glucose oxidase method, Yellow 
Springs Instruments). Plasma insulin and C-peptide levels (in pmol/l) were deter- 
mined by commercial chemiluminescence assays for ADVIA Centaur (Siemens 
Medical Solutions). BMI was calculated as weight divided by squared height (in 
kg/m’). OGTT-derived insulin sensitivity was estimated as proposed earlier’): 
10,000/[c(Glco)*c(Inso)*c(Glemean)*¢(InSmean)]42 (with c= concentration, 
Glc= glucose and Ins = insulin). OGTT-derived insulin secretion was estimated 
as area under the curve (AUC) Cpepo-30/AUC Glco_39 according to the formula: 
[c(Cpepo)-+c(Cpep3o) |/[c(Gleo)+c(Gle39)] (Cpep = C-peptide). 
Selection of tagging SNPs and genotyping. Based on publicly available data 
from the 1000 Genomes Project (http://browser.1000genomes.org/index.html), 
we analysed in silico a genomic area on human chromosome 1q23.3 spanning the 
CFAP126 gene (3.143 kb, five exons, four introns, located on the reverse strand) 
and 2kb of the gene’s 5/-flanking region. Within the analysed CFAP126 locus, 
36 SNPs were found. Using the tagger analysis tool of Haploview (see http:// 
www.broadinstitute.org/scientific-community/science/programs/medical-and- 
population-genetics/haploview/%20haploview), seven tagging SNPs were 
identified that cover all the other common SNPs (minor allele frequency > 0.01) 
with an r? > 0.8. These SNPs were rs17399583 (C/T), rs11584714 (C/G) and 
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1857835711 (C/G) in the 5'-flanking region, rs114482063 (A/T) and rs182840301 
(C/A) in intron 1 and rs16832872 (G/A) and rs75715534 (C/G) in intron 3. 
For genotyping, DNA was isolated from whole blood using a commercial kit 
(NucleoSpin, Macherey & Nagel). The tagging SNPs were genotyped using the 
mass spectrometry system massARRAY from Sequenom and the manufacturer's 
iPLEX software (Sequenom). The call rates were > 96.1%. The mass spectrometric 
results were validated in 50 randomly selected subjects by bidirectional sequencing 
and both methods gave 100% identical results. 

RNA sequencing expression data analysis of human pancreatic islets. RNA 
sequencing expression data on human pancreatic islets from m= 119 donors 
including n = 14 diagnosed type 2 diabetes (T2D) donors were obtained from the 
Human Tissue Laboratory (HTL) of Lund University Diabetes Centre (LUDC)”. 
Gencode v14 RefSeq was used for gene annotation. Across-samples normaliza- 
tion was performed using the TMM normalization method’. HbA1c informa- 
tion available for 106 human pancreatic islets donors was used for classification 
into a) healthy/non-diabetic donors (normal glucose tolerance, NGT, n= 66, 
HbAIc <6), b) pre-diabetic donors (impaired glucose tolerance, IGT, n=21, 
6 < HbAIc<6.5) and c) diabetic donors (T2D, n= 19, diagnosed + HbAIc > 6.5). 
Associations of gene expression in human pancreatic islets with blood glucose 
level (NGT+IGT+T2D status) was performed with ANOVA for linear testing 
and using the Kruskal-Wallis method for nonlinear and non-parametric testing. 
Correcting for multiple testing was performed with FDR procedure and 5% sig- 
nificance threshold was selected for the analysis. All calculations were done using 
R language for statistical computations. The upper whisker of the boxplots 
represents min (max (x), Q_3 + 1.5 * IQR) and the lower whisker max (min(x), 
Q_1—1.5* IQR). IQR represents the box length Q_3 — Q_1. 

Statistical analysis. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded 
to allocation during experiments and outcome assessment. Statistical analysis 
was performed using GraphPad Prism 6 Software (GraphPad Software, USA). 
Values were compared using unpaired or paired t-tests or ANOVA as indicated for 
each experiment. P values of <0.05 were considered statistically significant. Data 
are expressed as means + s.e.m. unless otherwise specified. Statistical analysis of 
microarray profile was performed using the statistical programming environment 
R (R Development Core Team) implemented in CARMAweb. The P value of the 
limma t-test was used as a criterion for significance (P< 0.01) and in addition 
filtered for fold-changes >1.5 to potentially increase biological relevance. Multiple 
testing corrections were not applied, because with only two replicates this approach 
would be too stringent (the most significant gene after Benjamini-Hochberg 
correction had a false discovery rate <22%). Heatmaps were generated with 
CARMAweb* and cluster dendrograms with the R script hclust. GO term enrich- 
ments were performed for 1.5-fold regulated genes with a P-value <0.005 using 
the GePS module in the Genomatix Software Suite v3.1 (Genomatix). The pathway 
analyses (P < 0.05) were generated through the use of QIAGEN’s Ingenuity Pathway 
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Analysis (IPA, QIAGEN Redwood City, http://www.qiagen.com/ingenuity). 
For association analysis, continuous variables with non-normal distribution were 
log.-transformed before statistical analysis. Multiple linear regression analysis 
was performed using the least-squares method. In the regression models, insulin 
secretion was chosen as an outcome variable, the SNP genotype (in the dominant 
inheritance model) as an independent variable and gender, age, BMI and insu- 
lin sensitivity as confounding variables. SNP-BMI interaction effects on insulin 
secretion were tested by ANCOVA with gender, age and insulin sensitivity as con- 
founding variables. When testing all seven tagging SNPs in parallel, a Bonferroni- 
corrected P-value <0.0073 was considered statistically significant. For all analyses, 
the statistical software JMP 8.0 (SAS Institute) was used. 
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Extended Data Figure 1 | FVR expression is induced in compacted islet 
structures and correlates with high Nkx6.1 expression. a, Overview 

of a cleared whole-mount pancreas from an Fitp2¥/* mouse at E18.5, 
stained with anti-Nkx6.1 and anti-GFP antibodies and scanned with a 
laser confocal microscope. b-e, In compacted islets (dotted line), high 
FVR expression correlates with high Nkx6.1 expression. f-i, In cord-like 
structures of endocrine cells that left the ductal epithelium (dotted 

line), low Nkx6.1 expression correlates with low FVR expression. 

j, Quantification of the mean fluorescent intensity of Venus (FVR)t 
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and Nkx6.1* cells in compacted and cord-like structures (n (mice) = 3, 5 
cord-like structures, 6 compacted islets; ***P = 0.0005, two sided unpaired 
t-test). k, Quantification of Nkx6.1 (red) and Venus (green) single- 
positive cells as well as all Nkx6.1-Venus double-positive cells (yellow) in 
compacted and cord-like structures (m (mice) = 3, cord-like structures 570 
cells, compacted islets 1,827 cells; **P = 0.0064 for Venus, **P =0.0041 
for Nkx6.1, *P=0.0182 for Venus—Nkx6.1, two-sided unpaired t-test). 
Scale bars, 100 1m (a), 201m (b-i). Data represent mean + s.e.m. 
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Extended Data Figure 2 | FVR~ and FVR* @-cell subpopulations 
exhibit differential proliferative capacity and inter-islet heterogeneity 
in the head and tail of the pancreas. a, b, LSM images showing FVR, 
Nkx6.1 and EdU (a) and quantification (b) of EdUt in FVR~ and FVRt 
B-cell subpopulations in control and pregnant (G15.5) mice (green/red 
arrow heads indicate proliferating FVR*/FVR™ (-cells. n (control) = 4 
(8,500 Nkx6.1* cells), n (pregnant) = 4 (10,938 Nkx6.1* cells); **P 
(Ctrl) = 0.0049, **P (pregnant) = 0.0059, two-sided unpaired f-test). 

c, Scatter plot of FVR* 6-cell distribution relative to number of 8-cells 
per islet in Fltp2”’* mice (n (mice) = 10, n (islets) = 512). d, Quantification 
of islet size distribution in head and tail of Fltp”’* pancreas sections 

(n (mice) = 4, n (islets) = 299; *P (20-50) = 0.0419, *P (>100) = 0.0238, 
two-sided unpaired t-test). e, f, LSM images (e) and quantification (f) 

of FVR* B-cell distribution in the pancreas head and tail (n (mice) = 4, 


n (islets) = 299, n (Nkx6.1* cells) = 19,054; ****P < 0.0001, two-sided 
unpaired t-test). g, Quantification of EdU incorporation among FVR™ 
and FVR* {-cell subpopulations relative to number of 8-cells per islet 

in pregnant (G15.5) Fitp2”’* mice (n (mice) =4, n (islets) = 196, 

n (Nkx6.1* cells) = 10,938). h, LSM images showing Nkx6.1-, p27- 

and FVR-expressing cells in islets of Fltp2”’* pregnant (G15.5) mice. 

i, Quantification of p27 and Venus fluorescent intensity shows that 
p27-high cells have a higher Venus fluorescent intensity than p27-low 
cells in pregnant (G15.5) mice ( (mice) = 3, m (p27 and Venus positive 
cells) = 4,204). j, Quantification of Venus and EdU fluorescent intensity 
shows that EdU-positive cells have lower Venus fluorescent intensity than 
EdU-negative cells in pregnant (G15.5) mice (n (mice) = 3, n (p27*, EdUT 
and Nkx6.1* cells) =75, n (p27* EdU~ Nkx6.17 cells) = 4,845). Scale bars, 
50m (a, e, h). Data represent mean + s.e.m. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Validation of the sorting scheme and 
molecular and cellular analysis of FVR~ and FVR* endocrine 
subpopulations. a, b, Quantification (a) and LSM images (b) of FACS- 
sorted FVR populations for the microarray analysis show equal numbers 
of endocrine cell lineages among the subpopulations (n (DAPI* 

cells) = 789). c, Detection and purification of four distinct endocrine 
subpopulations (I-IV) on the basis of granularity and Venus expression. 
d, e, Quantification (d) and LSM images (e) of sorted FVR subpopulations 
Tand II confirms high enrichment of Nkx6.1* -cells whereas III and IV 
exhibit high enrichment of glucagon* «-cells (n (DAPI* cells) = 3,275). 

f, Expression analysis of endocrine hormones and the exocrine marker 
Amy2a3 reveals minimal contamination with other endocrine cell types 
(n (independent experiments) = 6 (FVR*) and 5 (FVR_)). g, h, Validation 


of the microarray analysis by real-time qPCR of 8-cell-enriched 
subpopulations (I, II) for 6-cell maturation, mitochondria and metabolic 
signalling pathway genes (Atp5b, Hadh, Wnt4, Grb10, Wnt5b, Ins1, 

GFP, Ftp, Pcsk1, Gipr, Mafa: n (independent experiments) = 6; Bace2, 
Alpk1, Npy: n (independent experiments) = 5, Slc2a2: n (independent 
experiments) = 3; *P < 0.05, **P< 0.01, ***P < 0.001, two-sided unpaired 
t-test). i, Real-time qPCR of mtDNA (S12) versus nDNA (HBB) from 
8-cell enriched populations (I, II) (n (independent experiments) = 5, 
expression in the FVR* subpopulation normalized on each FVR™ 
subpopulation, error bars represent c.i.). j, Scatter plot of regulated 
polarity genes among FVR endocrine cells in microarray analysis (thin 
lines mark 1.5-fold difference in expression, P < 0.05, limma t-test). Scale 
bars, 100m (b, e). Data represent mean + s.e.m. except where stated. 
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Extended Data Figure 4 | Transmission electron microscopy (TEM) 
analysis of Fltp~ and Fltp* 6-cell subpopulations and quantification 
of mitochondria membrane length. a, b, TEM images of immunogold- 
labelled 3-cells of Fltp??4“"’*;Gt(ROSA)26™"@/* mice reveal no 
structural differences in Golgi complex (a) or endoplasmic reticulum (b). 
c, d, TEM images and detected areas of the algorithm used for mitochondria 
analysis (c) and analysis of inner mitochondria membrane length 


LETTER 


Fitp lineage" 


Inner membrane length [nm]/ 
mitochondrial area [nm] 


normalized on the mitochondria area in FVR* and FVR’ B-cells (d) 

(n (independent experiments) = 2, n (FVR* cells) = 14, n (FVR™ 

cells) = 16, n (mitochondria) = 544). e, TEM images of mitochondria in 
FVR* compared to mitochondria in FVR™ {-cells. Scale bars, 500 nm 

(a (Golgi complex and gold staining), c, e), 200 nm (a (insulin granules), 
b). Data represent mean. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Analysis of oxygen consumption rate (OCR), 
mitochondria network and GSIS in FVRt and FVR™ @-cells. a, OCR 
of FACS-sorted 8-cell-enriched FVR* and FVR™ subpopulations after 
Antimycin A subtraction. b, Quantification of absolute OCR rates of 
FACS-sorted (-cell-enriched FVR* and FVR™ subpopulations after 
Antimycin A subtraction. FVR* n (technical replicates) =9, FVR7 

n (technical replicates) = 4, for each technical replicate 3 x 10* cells 

were plated; endocrine cells were isolated from 9 mice; two-sided 
unpaired t-test. c, OCR of FACS-sorted (-cell-enriched FVR* and 

FVR™ subpopulations after Antimycin A subtraction and baseline 
normalization. d, Quantification of relative OCR of FACS-sorted B-cell- 
enriched FVR* and FVR™ subpopulations after Antimycin A subtraction 
and baseline normalization. FVR* n (technical replicates) = 9, FVR— 

n (technical replicates) = 4, two-sided unpaired t-test. e, h, LSM pictures 
of immunohistochemistry of isolated Fltp2”/* mouse islets cultured under 
high (16.5 mM) and low (2 mM) glucose concentrations with single-cell 


LETTER 


reconstructions (Imaris) showing the reconstructed membrane (light 
brown), the nuclei (white, green and blue) and the mitochondria (red). 

f, g, i,j, Enlargement of a single Nkx6.1* cell under high (f) and low 
glucose (i) and an Nkx6.1* FVR* cell under high (g) and low glucose (j) 
with reconstruction of the nucleus, the membrane and the mitochondria. 
k, Quantification of the mitochondrial morphology of Nkx6.1-FVR 
double-positive cells (yellow) and Nkx6.1 single-positive cells (white) 
under high and low glucose stimulation (n (Nkx6.17 FVR*) =9, 

n (Nkx6.1* FVR7™ in high glucose) = 4, n (Nkx6.1* FVR7 in low 
glucose) = 6, from three islets for low glucose and two islets for high 
glucose). 1, FACS-sorted and re-aggregated }-cell-enriched subpopulations 
reveal differences in GSIS and arginine depolarization (n (independent 
experiments) = 3, n (aggregates FVR™) = 16, n (aggregates FVR7) = 12; 
*P=0.0206, **P= 0.007, two-sided unpaired t-test). Scale bars, 10 1m 
(e, h), 241m (f, j), 3j1m (g, i). Data represent mean + s.e.m. 
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Extended Data Figure 6 | Fltp genetic lineage tracing of endocrine GTT by AUC quantification (n (mice on normal diet) = 4, n (mice on 
subpopulation dynamics upon islet transplantation and HFD. HFD) =5; *P=0.0223, **P=0.0064, Sidak’s multiple comparison test). 
a, Maximum intensity projection of laser backscatter to determine the d, Longitudinal quantification of the relative volumes of islet size 
volume of the same transplanted Fitp!?“""/*;Gt(ROSA)26""°’* islet and vessel volume during engraftment time (n (islets) = 15 (from 
before (Pre-diet) and after 8 weeks of HED. b, Quantification of the total 4 mice); ****P < 0.0001, ***P = 0.0027, two-sided unpaired t-test). 
islet volume (transplanted Ftp ?Aicre/ +;Gt(ROSA)26"!"°/*) in mice on e, Quantification of mG and mT endocrine cell numbers every 10 pm 
normal diet and HFD (n (islets) = 25 (from 4 mice on normal diet), in a confocal z-stack during engraftment (n (islets) = 15 (from 4 mice); 
n (islets) = 34 (from 5 mice on HFD); ****P < 0.0001, Sidak’s multiple **** P< 0.0001, two-sided unpaired t-test). Scale bars, 50,1m (a). Data 


comparison test). c, Quantification of glucose levels during intraperitoneal represent mean +s.e.m. 
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Extended Data Figure 7 | 3D architecture and Wnt/PCP induce 

B-cell maturation in Min6 cells. a, c—e, g, h, LSM images (a, e) and 
quantification (c, g (3 days culture), d, h, (7 days culture)) of Nkx6.1 

and MafA mean fluorescent intensity in Min6 cells in 2D and 3D culture 
(c, n (independent experiments) = 3, ***P = 0.0002; d, n (independent 
experiments) = 4, *P = 0.0264; g, n (independent experiments) = 2; 

h, n (independent experiments) = 3; Sidak’s multiple comparison test). 

b, f, Fluorescent intensity of Nkx6.1 (b) and MafA (f) in a single Min6 cell 
in 2D and 3D culture. White lines in a and e indicate measured regions. 

i, j, Gene expression analysis of Min6 cells cultured in 2D and 3D culture 


by qPCR (n (independent experiments) = 4; *P < 0.05, **P < 0.01, 

** P < (001, two-sided unpaired t-test). k, 1, LSM images (k) and 
quantification (1) of Nkx6.1+ Min6 cells in floating 3D culture treated 
with or without Wnt5a (n (independent experiments) = 3; n (cells) =5,419 
(control), 2,419 (Wnt5a); **P = 0.0036, Sidak’s multiple comparison test). 
m, n, Western blot (m) and quantification (n) of Nkx6.1 and Fltp protein 
in Miné6 after treatment for 6 days with or without Wnt5a in 2D 

or 3D culture (western blots were normalized on y-tubulin expression and 
standardized on 2D control, n (independent experiments) = 5). Scale bars, 
20m (a, e, k). Data represent mean +s.e.m. 
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Extended Data Figure 8 | Modulation of maturation and proliferation 
through WNT signalling in human microislets and EndoC-8 HI human 
8-cell line and SNP association with insulin secretion defects in the 
FLTP gene. a-e, LSM images (a-d; proliferating cells marked by arrow 
heads) and quantification (e) of NKX6.1-EdU double-positive EndoC-3 
H1 cells treated with or without WNT3A, WNT4 or WNT5A 

(n (independent experiments) = 3, 1,617 cells counted; *P= 0.019, 
wo-sided unpaired t-test). f, g, Western blot (f) and quantification 

(g) of pJNK in EndoC-8 H1 treated for 4h with or without WNT4 

(n (independent experiments) = 4; *P = 0.027, two-sided paired t-test). 
h, Fold change in insulin secretion in response to high glucose in 

human microislets treated with or without WNT4 or WNT5A (3 human 


donors; Ctrl: n (microislets) = 23, WNT4: n (microislets) = 25, WNT5A: 
n (microislets) = 26; *P = 0.0423, two-sided unpaired t-test). i, RNA 
sequencing expression data for SLC2A2 in human pancreatic islets from 
healthy donors (NGT, n = 66), pre-diabetic donors (IGT, n= 21) and 
type 2 diabetic donors (T2D, n = 19; one-way ANOVA). j, k, The minor 
allele of FLTP rs75715534 was significantly associated with increased 
insulin secretion in lean subjects (P= 0.0035), but not in obese subjects 
(P=0.07). Adjustment of the insulin secretion index AUC Cpepo_30/AUC 
Glco_30 (n = 2,228) was achieved by multiple linear regression modelling 
with gender, age and OGTT-derived insulin sensitivity as confounding 
variables. Scale bars, 251m (a, d). Data represent mean +s.e.m. (e, g, h); 
mean diamonds represent mean + 95% c.i. (j, k). 
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Extended Data Figure 9 | Knockout of Fitp has no effect on B-cell 
maturation or proliferation. a—d, Single plane LSM images of 
cryosections from adult Fitp2¥/* (a, c) and Fitp2¥/2V (b, d) mice. Some 
Venus (green), Nkx6.1 (red) and MafA (white; a, b, white arrowheads) 

or Ucn3 (white; c, d, dashed lines) triple-positive cells are highlighted. 

e, Quantification of Venus, Nkx6.1 and Ucn3 (white) or MafA (black) 
triple-positive cells shows no difference in maturation status of 6-cells in 
Fitp2'/* and Fitp2"/2" mice (Fltp2”"*: n (mice) = 3, MafA 2,349 cells, 

Ucn3 1,212 cells; Fitp2¥/2"; n (mice) = 3, MafA 867 cells, Ucn3 1,825 cells). 
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f-i, LSM images of immunohistochemistry on cryosections from P11 and 
adult Fltp */* (£, P11; h, adult) and Fltp2/2V (g, P11; i, adult) mice showing 
the proliferation of islet cells. j, k, Quantification of proliferating B- and 
non-$-cells in Fitp’2" and Fitp*’* mice at P11 (j) (Fitp*’*: n (mice) =4, 
6,556 B-cells and 4,325 non-(6-cells; Fitp2¥/2V. n (mice) = 4, 6,709 6-cells 
and 5,868 non-$-cells) and in adults (k) (Fltp*/*: n (mice) =7, 9,803 


Non-f-cells 


B-cells and 4,338 non-(-cells; Fitp2"/2V: n (mice) = 9, 14,957 B-cells and 
7,955 non-(-cells). Scale bar, 20 j1m (a), 10j1m (b), 541m (c, d), 151m (f-i). 
Data represent mean s.d. 
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Extended Data Figure 10 | Analysis of the metabolic phenotype of (d) in pancreatic tissue from Fltp2"/2" and Fltp*/* mice (Fltp*’* n 
Fitp®’/2Y mice. a, Quantification of B-cell number in islets of Langerhans (mice) =5 and Fitp"/2" n (mice) =5). e, ITT in Fitp2”’2" and Fltp*/* mice 
from Fitp*’* and Fltp~"/2" mice at P11 or adult (n (mice) =3 (P11 Fltp’’*, (Fltp*’* n (mice) =5 and Fitp"/2" n (mice) = 3). f, Plasma insulin level 
6,556 Nkx6.1* cells and Fitp?”’2", 6,791 Nkx6.1* cells), n (mice) =6 in fasted Fltp*’* and Fitp"/2" mice (Fltp*/* n (mice) = 11 and Fitp2"/2” 
(adult Fitp*’*, 3,202 Nkx6.1* cells) and n (mice) = 4 (adult Fitp2"/2V, n (mice) = 10; *P = 0.0405, two-sided unpaired t-test). g, GSIS in isolated 


6,263 Nkx6.1* cells)). b, Intraperitoneal GTT in Fltp2”’2" and Fltp*’/* mice __ islets from Fltp*/* and Fitp*”’”" mice (Fltp’’* n (mice) =5 and Ftp?” 
(Fltp*/* n (mice) =7 and Fitp2”/2" n (mice) =7; **P = 0.0022, two-sided 2V'n (mice) =5; *P = 0.0263, one-way ANOVA, Bonferroni’s multiple 
unpaired t-test). c, d, Concentration of glucagon (c) and insulin comparison test). Data represent mean + s.e.m. 
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Mobile genes in the human microbiome are 
structured from global to individual scales 


1. L. Brito!?, S. Yilmaz**, K. Huang*, L. Xu?*, S. D. Jupiter*, A. P Jenkins®, W. Naisilisili*, M. Tamminen®’, C. S. Smillie, 
J.R. Wortman?, B. W. Birren?, R. J. Xavier*®?, P. C. Blainey?, A. K. Singh®, D. Gevers? & E. J. Alm!*° 


Recent work has underscored the importance of the microbiome in 
human health, and has largely attributed differences in phenotype 
to differences in the species present among individuals!~>. However, 
mobile genes can confer profoundly different phenotypes on 
different strains of the same species. Little is known about 
the function and distribution of mobile genes in the human 
microbiome, and in particular whether the gene pool is globally 
homogenous or constrained by human population structure. Here, 
we investigate this question by comparing the mobile genes found 
in the microbiomes of 81 metropolitan North Americans with those 
of 172 agrarian Fiji islanders using a combination of single-cell 
genomics and metagenomics. We find large differences in mobile 
gene content between the Fijian and North American microbiomes, 
with functional variation that mirrors known dietary differences 
such as the excess of plant-based starch degradation genes found in 
Fijian individuals. Notably, we also observed differences between 
the mobile gene pools of neighbouring Fijian villages, even though 
microbiome composition across villages is similar. Finally, we 
observe high rates of recombination leading to individual-specific 
mobile elements, suggesting that the abundance of some genes may 
reflect environmental selection rather than dispersal limitation. 
Together, these data support the hypothesis that human activities 
and behaviours provide selective pressures that shape mobile 
gene pools, and that acquisition of mobile genes is important for 
colonizing specific human populations. 

Bacteria rapidly evolve and adapt to changing environments by 
acquiring new genes from other bacteria in their environments. 
Comparison of reference genomes from the microbiomes of individuals 
living in different countries around the world has shown that geography 
does not present a substantial barrier to gene flow®. Moreover, there are 
specific examples of genes that have swept through global populations, 
such as the antibiotic resistance gene New Delhi metallo-beta-lactamase 1 
(ref. 7). These observations raise the possibility that the mobile gene 
pool may be uniform across continents despite the existence of large 
differences in the compositions of culturally distinct microbiomes*’. 

By contrast, there is also evidence that gene pools may be population- 
specific. First, horizontally transferred genes often provide a selective 
advantage to the host organism®!"!. For example, seaweed in the 
traditional Japanese diet is believed to have led to selection for algal 
polysaccharide degradation genes that are underrepresented in North 
American populations’. Similarly, dental fillings containing mercury 
have led to an increase in mercury-resistance genes’*. The use of clinical 
and agricultural antibiotics has resulted in the propagation of resist- 
ance genes in both pathogenic and commensal gut microbes'*!°. Other 
cultural practices, including religious practices, travel, food origin and 
preparation, and beauty and hygiene product usage, may also shape 
the mobile gene pool. 


Here, we combine single-cell genomics with metagenomics to survey 
the mobile gene pool and investigate whether mobile genes are mainly 
globally distributed or population-specific. We compared the mobile 
gene pools of 81 participants in the US-based Human Microbiome 
Project (HMP)'° (Supplementary Table 1) with those of 172 Fijian 
islanders participating in the Fiji Community Microbiome Project 
(Fiji COMP) (Supplementary Table 2). FijiCOMP represents the first 
terabase-scale metagenomic view of the microbiome in the developing 
world. 

Cataloguing the mobile gene pool in a large cohort using short-read 
metagenomic sequences is difficult, so previous analyses have been 
constrained to individual species!!!” or specific mobile elements such 
as plasmids'®*° and phages*!*. One reliable method for cataloguing 
mobile genes, including integrated transposons and prophages, 
depends on assembled genomes and is based on identifying identical 
or nearly identical genes present in distantly related bacterial hosts®. 
We used this method to identify 15,585 mobile genes from the 387 
HMP gut microbiome reference and draft genomes” (Supplementary 
Tables 3, 5). Similar to complete genomes, even draft single-cell 
genomes provide enough context to link genes to hosts and to identify 
mobile genes. We used 180 single-cell genomes, derived from seven 
FijiCOMP participants, to identify an additional 22,268 mobile genes 
(Supplementary Tables 4, 5 and Extended Data Fig. 1). We then inves- 
tigated which metagenomic libraries contained reads that mapped to 
this set of mobile genes to survey their abundances across individuals 
and populations (Extended Data Fig. 2). 

Most mobile genes (62.4%) were present to some extent in both study 
populations, consistent with the previous finding that horizontal gene 
transfer (HGT) is not strongly structured by geography. Surprisingly, 
however, the abundance of those genes across populations was notice- 
ably distinct (Extended Data Fig. 3a; P < 10>°; permutational mul- 
tivariate analysis of variance (PERMANOVA) based on Bray—Curtis 
dissimilarity, 10° permutations), an observation that holds true even 
when considering a subset of the most stringently defined mobile genes 
(Extended Data Fig. 3b; P< 10-°, PERMANOVA based on Bray—Curtis 
dissimilarity, 10° permutations). Surveys of organisms endogenous to 
each population are required to fully observe both mobile gene pools 
(Extended Data Fig. 3c). We reasoned that diet might be a strong factor 
in differentiating the mobile gene pool across populations because 
many of the most highly consumed Fijian food items (taro, cassava 
breadfruit, coconut, and certain seafood) are not widely consumed in 
the US (see Methods). 

We specifically examined genes encoding glycoside hydrolases 
as these include substrate-specific dietary enzymes. In the Fijian 
microbiomes, we found a high abundance of mobile genes from the 
glycoside hydrolase GH13 family (Fig. 1a, Supplementary Table 6 and 
Extended Data Fig. 4), which encompasses most of the dietary starch 
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Figure 1 | Enrichment of functional mobile genes is locale-specific. 

a, Prevalence and abundance of all of the annotated mobile glycoside 
hydrolase families present in the FijiCOMP (n =172, red) and HMP 
(n=81, blue) metagenomic stool samples. Abundances were measured in 
fragments per kilobase of protein coding sequence per million mapped 
reads (FPKM) for each of the horizontally transferred genes, aggregated 
according to glycoside hydrolase family and plotted as a function of the 
density across samples. For each glycoside hydrolase family, the number 
of unique horizontally transferred genes present across the two cohorts is 
plotted (in grey), as are the sources of their substrates. b, Prevalence and 


degradation enzymes”. By contrast, glycoside hydrolases that degrade 
animal- and fungus-derived glycans exhibited bimodality between the 
American and Fijian cohorts, which may represent different dependen- 
cies on farm animals versus seafood that predominate in each of these 
cultures’ diets. We confirmed the dietary differences between the two 
cohorts directly by metagenomic sequencing of stool samples, as the 
metagenomes of Fijians harboured significantly higher levels of plant 
matter (P < 107}, Mann-Whitney test; Fig. 1b). Thus, differences in 
diet-related genes in the microbiome, which have also been observed 
in previous cross-cultural comparisons””», may be due in part to differ- 
ential abundances of mobile genes, rather than to the presence of genes 
that can be attributed to specific taxa. There are likely to be additional 
non-mobile glycoside hydrolase genes that differ across populations. 
GH13 family genes were indeed enriched in the overall microbiomes of 
the FijiCOMP cohort versus the HMP cohort (see Methods), although 
we were unable to determine what fraction of these are mobile. The 
advantage of looking at mobile genes is that it narrows our focus to 
specific genes rather than species-level effects that might be more 
difficult to link directly to environmental factors such as diet. 

We next hypothesized that antibiotic resistance genes might differ 
across populations, reflecting the high usage of beta-lactams and 
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abundance of plant matter (read alignments to rRNA from the kingdom 
Viridiplantae) in the metagenomes of stool samples from the FijiCOMP 
(red) and HMP (blue) populations. c, Prevalence and abundance of 
annotated mobile antibiotic resistance genes across the FijiCOMP (red) 
and HMP (blue) metagenomic stool samples. P value based on Mann- 
Whitney test. d, Prevalence and abundance of eight village-specific mobile 
genes (of 31 total village-specific genes) across four Fijian villages. q values 
(a, b, d) of prevalence comparisons based on FDR-corrected Fisher's exact 
tests; q values of abundance comparisons based on FDR-corrected Mann- 
Whitney tests. 


quinolones in Fiji (see Methods). We found that quinolone resistance was 
more pervasive in the Fijian cohort than the American cohort (Fig. 1c 
and Supplementary Table 7). Resistance to cephalosporins, which have 
been introduced relatively recently, was primarily an American phe- 
nomenon. Despite limited access to the full range antibiotics in Fiji, 
resistance genes for most classes of antibiotics were found in the Fijian 
population, consistent with recent findings in other developing world 
communities”®. These results highlight our limited understanding of 
the forces that drive antibiotic resistance, and could reflect other uses 
of antibiotics, such as their use in agriculture, or even point to new 
classes of antibiotic resistance that have not yet been characterized’. 
As the distribution of mobile genes in the microbiome shows differ- 
ences at a global scale, we hypothesized that gene distributions varied 
at even finer scales of resolution. The FijiCOMP cohort provides a 
unique opportunity to look at fine-scale structure because it includes 
individuals living in three villages in the remote Bua province and 
a fourth village in the agricultural Macuata province. Therefore, we 
checked whether any genes were specifically associated with one or 
more villages. Notably, we identified many village-associated mobile 
genes (31 genes varied significantly according to abundance, based on 
a Mann-Whitney test of gene abundances using FDR with q < 0.05, 
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Figure 2 | Microbiome composition across global and local populations. 
a, Principal coordinates analysis of the Jeénson—Shannon divergence 
between species compositions of the FijiCOMP (n= 172, red) and HMP 
(n=81, blue) metagenomic samples. b, Principal coordinates analysis 

of the Jenson-Shannon divergence between species compositions in the 
FijiCOMP metagenomic samples, according to village membership. 


or prevalence, based on a Fisher’s exact test of village association using 
FDR with q < 0.05) (Fig. 1d and Supplementary Table 8). Village- 
associated mobile genes include those encoding HGT machinery, 
such as proteins involved in plasmid segregation, as well as proteins 
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that may directly provide a fitness advantage to their host, such as 
ferredoxins. 

The identification of village-associated mobile genes was unexpected 
because individuals’ microbiomes are more similar among villages 
(P > 0.05, PERMANOVA based on Jensen-Shannon divergence, 10° 
permutations) than countries (P < 10°, PERMANOVA based on 
Jensen-Shannon divergence, 10° permutations) (Fig. 2, Extended Data 
Fig. 5 and Supplementary Table 9). Even though the mobile genes in our 
data set are defined by their occurrence in multiple phylogenetic back- 
grounds, gene exchange is known to be more frequent among closely 
related species®. We therefore investigated whether the village-associated 
genes were associated with species that were also partitioned unevenly 
across villages. For six of the 31 village-associated genes, we found a 
correlation with one of the ten village-associated species, although five 
of these six genes were also correlated with other non-village-associated 
species (see Methods). Overall, our findings suggest that these mobile 
genes are not restrained by their host genomes. 

This finding prompted us to investigate the extent to which species 
composition determines mobile gene pool composition. First, we 
tested whether mobile genes originally identified in Bacteroides 
genomes or Prevotella genomes were more common in the metagen- 
omes of individuals whose microbiomes were dominated by the same 
species. However, we did not find a significant association between 
the dominant taxon in an individual and their likelihood of carrying 
mobile genes of similar origins. Next, we investigated the taxonomy of 
bacteria hosting specific mobile elements by examining paired reads 
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that span junctions between mobile genes integrated next to evolu- 
tionarily conserved tRNAs. Out of 838 mobile genes found adjacent to 
tRNAs, 65 were found near tRNAs from more than one genera, with 
the most promiscuous occurring next to tRNAs from 25 different 
genera (Supplementary Table 10; examples shown in Extended Data 
Fig. 6). Even within a single individual, these genes were not tied to the 
dominant bacteria present within that individual’s microbiome, and 
often spanned multiple genera, including those in both Prevotellaceae 
and Bacteroidaceae families. Thus, horizontal gene transfer is likely to 
be driven by forces independent of species community composition. 

The high rates of recombination observed among mobile genes sup- 
port the idea that mobile genes are even less dispersal-limited than 
their host genomes. Following the logic of the Baas-Becking hypoth- 
esis that “everything is everywhere, but the environment selects,’ we 
might infer that environmental selection rather than dispersal drives 
differences in gene abundance among populations. Extending this logic 
from species to genes, however, is not straightforward because genes 
can change in abundance as a result of genetic ‘hitchhiking’ on mobile 
elements under selection at a different locus’”. We reasoned that if 
recombination rates are sufficiently high, then genes should be present 
in many contexts. On the other hand, if gene abundances are driven 
by selection on larger mobile elements, then genes should appear in a 
limited number of contexts. 

To investigate the genomic contexts of our mobile genes, we used 
the alignment and orientation of paired-end metagenomic reads to 
assemble short contigs encompassing each mobile gene (Fig. 3a—c). 
As expected, we found that recombination was limited, although not 
absent, within operons (as defined by adjacent genes in the same orien- 
tation and of the same functional category; Fig. 3d), and that horizontal 
transfer machinery, such as phage-, plasmid- or transposon-specific 
genes, had higher levels of recombination (Fig. 3e). However, despite 
the high prevalence of genes and broader gene functions, we found that 
34.9% of the gene contexts, defined as the set of unique combinations 
between adjacent genes that were observed, were specific to individuals; 
very few of these gene contexts were conserved across populations 
(Fig. 4 and Extended Data Fig. 7). 

Together, high recombination rates and population-specificity 
support the notion that environmental selection on individual genes, 
rather than dispersal limitation alone, plays a key role in driving gene 
abundances. The ‘everything is everywhere’ concept is reinforced for 
mobile genes by the observation that the majority of genes were found 
in both the HMP and FijiCOMP populations (Fig. 4). Even among the 
village-associated genes, all but five were found in all of the villages. 
Dispersal may play a role in shaping the distribution of a small, but 
abundant, subset of genes that are restricted to one population (that is, 
gene families (GH67, GH28 and GH110, Fig. 1a) or subsets of villages 
(Fig. le)). Nevertheless, environmental selection is underscored by the 
impact of diet on gene abundance. In fact, only one of the universal 
genes, defined as those present in more than 75% of both populations, 
was annotated as carbohydrate-metabolizing, despite carbohydrate- 
metabolizing genes being significantly enriched among population- 
specific genes, defined as those present in more than 50% of one 
population and less than 10% of the other (P < 10~°, Pearson's x’ test). 

How selection and dispersal affect gene exchange within physically 
proximal environments, that is, within a single individual's body sites, 
is still an open question®. On one hand, differences in composition 
and ecology may result in distinct mobile gene pools at various body 
sites®?®. On the other hand, the direct route between oral and gut 
communities should facilitate transmission of mobile elements, and 
systemic selective pressures, such as those imposed by orally adminis- 
tered antibiotics, may homogenize personal gene pools. Our analysis 
of saliva samples derived from FijiCOMP participants shows that there 
is little overlap between gut and oral mobile gene pools, as only 0.94% 
of the genes represented in the gut were detectable in any of the saliva 
samples. For any particular gene, there was no correlation between its 
presence or abundance between stool and saliva samples from the same 
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Figure 4 | Genes are widespread across global populations, but specific 
mobile genetic element architecture is not. The percentages of gene 
families (determined by functional annotations) (left), identical genes 
(middle), and gene contexts, defined by unique linkages between genes 
(right) are plotted according to their prevalence across the FijiCOMP and 
HMP populations. Gene families, genes and gene contexts are referred 

to as personal (present in a single individual), rare (in <5% of one 
population and absent in the other), enriched (in >50% of one population 
and <10% of the other), common (in > 25% of both populations), 
widespread (in > 50% of both populations) or universal (in > 75% of both 
populations). Colour reflects association with one or both populations. 


individual. These data support the hypothesis that shared selective pres- 
sures and common ecologies structure horizontal transfer, although 
they do not rule out the possibility that physical proximity plays a role. 

As well as presenting a data set describing the microbiome of a pop- 
ulation in the developing world, we have also described an approach 
that can be used to identify environmentally relevant genes. Previous 
shotgun metagenome approaches have focused on the abundance of a 
gene as a proxy for its importance. This approach will identify impor- 
tant genes but also has the potential to identify many spurious genes 
because a single highly abundant species can carry many genes that 
are not specifically relevant to the environment it occupies. Instead, 
we look for abundant genes that are present in multiple species, using 
horizontal gene transfer as an additional filter for gene importance. 
Our approach is subject to several important caveats. First, its sensi- 
tivity for detecting mobile genes is low, and there are likely to be many 
more genes transferred across species than we can detect. Second, even 
though each of the mobile genes within our data set exists in more than 
one species, some of those genes may be primarily associated with a 
single taxon. This is especially true if a single taxon is much more abun- 
dant than the other species that carry the gene. 

Despite these caveats, we found that the human-associated mobile 
gene pool differed between populations and carried gene functions that 
are likely to be associated with cultural practices; these findings provide 
functional insights not possible based on surveys of phylogenetic mark- 
ers, such as the 16S rRNA gene, alone. These insights have the poten- 
tial to improve public health at multiple levels. For example, a better 
understanding of the distribution of antibiotic-resistance genes among 
different populations’ microbiomes could inform antibiotic stewardship 
at a regional level, by avoiding the use of specific antibiotics where 
resistance to them is highest. We also showed that the mobile genetic 
elements are highly diverse among individuals, raising the possibility 
that they may vary within bacterial lineages in an individual over short 
time spans. If this is true, then diet and drugs could modify the func- 
tions of the microbiome, even if the long-term species composition is 
stable?*?°, To assess whether changes to cultural practices will influence 
human health via mobile genetic elements, future studies should test 
the speed at which selective pressures alter mobile gene frequencies 
within single individuals and larger populations. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Overview of the Fiji Community Microbiome Project. The Fiji Community 
Microbiome Project was developed to characterize the role of human-associated 
bacteria in health and disease from a developing world population recruited in 
the Fiji Islands. The study sought to understand the transmission of microbiome 
components among individuals and their environmental surroundings. The goal 
was to be as comprehensive as possible in the study villages. The study included 
300 individuals, regardless of health status, each of whom chose to provide any or 
all of stool, saliva or skin swab samples. To date, this is one of the largest cohorts 
on which metagenomic sequencing has been performed. Surveys were performed 
by visiting all households within each community. Individuals under the age of 
5 years were excluded, as were individuals who were deemed mentally incapable 
of providing informed consent. Informed consent was received from all partici- 
pants and parental consent was additionally required for all minors. IRB approval 
was received from Institutional Review Boards at Columbia University, the 
Massachusetts Institute of Technology and the Broad Institute and ethics approv- 
als were received from the Research Ethics Review Committees at the Fiji National 
University and the Ministry of Health in the Fiji Islands. Whole genome metagen- 
omic shotgun sequencing was performed on individuals’ stool and saliva samples, 
in addition to environmental samples from individuals’ proximal environments. 
More information and links to the data set can be found at http://www.fijicomp.org. 
Samples used in this study. Saliva samples were collected in 20% glycerol 
and frozen within 30 min of collection. Stool samples to be used for metagen- 
omic sequencing were collected within 30 min of voiding, stored in RNALater 
(QIAGEN) and frozen at 80°C. Stool samples used for retrieval of intact cells for 
single-cell analysis were collected within 30 min of voiding, stored in 20% glycerol 
and PBS, and frozen at —80°C. Seven individuals’ samples were selected from the 
FijiCOMP cohort for isolation of single-cell genomes (Supplementary Table 2). 
Samples destined for single-cell isolation were prepared using two different 
single-cell amplification approaches: one based on sorting individual cells and a 
second based on capturing individual cells within a hydrogel. 

Flow-sorted single-cell amplification. For the sorting method, thawed cells were 
resuspended in PBS-glycerol (20%) to a concentration of 10° cells per ml. Samples 
were serially filtered through 30-1] and 11-11 membranes then briefly sonicated 
(20s) and diluted 50-100-fold. They were sorted into individual wells of a 384-well 
plate containing 0.5 11 PBS. Single cells then underwent alkaline lysis for 15 min 
at room temperature, after which the solution was neutralized. Genomes were 
then amplified using multiple displacement amplification! for 16h at 30°C using 
high-purity ©29 polymerase expressed in Escherichia coli strain BL21_DE3. To 
initially identify amplified cells, the V68 region of the 16S rDNA locus was ampli- 
fied using 1:10 dilution of the MDA product and universal 16S primers 926F and 
1392R and then Sanger sequenced. Taxonomic classification was performed using 
the RDP naive Bayesian rRNA classifier (v. 2.6)°*. We chose cells from this group 
to sequence that maximized the likelihood of observing lateral transfer between 
cells by accounting for the previously observed rates of HGT between organisms 
of a specified phylogenetic distance (the per cent identity between pairs of 16S 
sequences were used as a proxy)°. 

MDA reactions were sonicated to shear large fragments of DNA. Samples then 
underwent end-repair using the NEB Quick Blunting Kit and were purified using 
a QIAGEN MinElute column. Illumina adapters containing 5-bp barcodes were 
ligated to the genomic amplicon using the NEB Quick Ligation Kit. The double- 
stranded adapters were modified using 5’ and 3’-amino modified bases at one 
end to prevent self-ligation. Size selection was then performed using SPRI beads. 
Libraries then underwent nick translation using NEB Bst polymerase. Libraries 
were each amplified using Phusion Polymerase (New England Biolabs). The final 
pooled library was then gel purified to retain fragments of sizes 250-600 bp and 
sequenced on an Illumina HiSeq. The average size of the final library constructs 
was 397 bp. The total sequencing depth was 310 million reads. Single-cell libraries 
were barcoded and sequenced. We took several precautions to ensure the fidelity 
of our final single-cell libraries. Barcodes used in library preparation were designed 
so that each barcode would require 2 base-pair shifts to mimic any other barcode 
in the study; that no indel would result in a different barcode; and that barcodes 
contained no more than two consecutive identical basepairs. Only reads containing 
exact barcode matches were incorporated into libraries. 

Hydrogel-based single-cell amplification. For the hydrogel method, 1011 of 
thawed cells were resuspended in 5001] PBST (0.1%). Samples were sonicated for 
20s and first filtered through a 35-j1m nylon mesh, followed by a 5-\1m membrane 
(Pall Corp.) with a 500-11 PBST wash. Samples were further diluted 500-2,000- 
fold in PBST to reach the final concentration of ~30 cells per jl. The diluted 
cell samples (2:1) then underwent alkaline lysis for 15 min at room tempera- 
ture, after which the solution was neutralized. Hydrogel monomer mix (1.3 mg 
4 Arm PEG Acrylate and 0.9mg SH-PEG-SH) and MDA master mix, including 
29 buffer (NEB), 50j1.M random hexamers with two phosphorothioate bonds 


at the 3’ termini, 2.5% DMSO, 0.4mM dNTP, 0.5 mg/mL BSA, 500nM SYTOX 
Orange (Invitrogen) and 1 jl REPLI-g sc Polymerase (Qiagen), were added to the 
side wall of the tube that contained the lysed cell sample. 2511 of each microbial 
concentration was added into a frame seal chamber and the sealed chamber was 
incubated at 30°C for 12h. Fluorescent DNA clusters were imaged and selected 
for hydrogel cluster retrieval. Approximately 0.2411 hydrogel and 10 pg DNA was 
captured in each cluster. This was dissolved and denatured in 1 j1l 400 mM KOH 
with 0.1 mM EDTA and 0.1 M DTT at 72°C for 10 min before neutralization. The 
neutralized product underwent a second MDA reaction (for 10 h) within a new 
hydrogel. The gel was then denatured and neutralized. 

MDA products were purified and quantified using the Quant-iT HS assay 
(Thermo Fisher Scientific) and normalized to 5 ng/\1l. Tagmentation reactions 
(Nextera) were carried out on 10 ng of purified DNA, and were followed by a SPRI 
cleanup. Unique PCR library barcoding using Index primers N7 and $5 (Illumina) 
was performed, followed with SPRI twice using equal volumes of beads to DNA. 
Libraries of were sequenced to a depth of ~1.5M 125-bp paired-end reads on 
Illumina’s Hiseq 2500. The Broad Institute’s Internal Genomics Platform’s custom 
designed paired-end library barcodes were used. Final library size was 200-300 bp. 
Assembly of single-cell genomes. Single genome amplicons were quality filtered 
(Phred score > 3), and filtered for reads that were less than 45 bp and for those that 
aligned with the human genome, the P aeruginosa PAO] genome (a laboratory 
contaminant) and the E. coli BL21_DE3 genome (from which the ©29 polymerase 
used in the MDA reaction was expressed and purified) using BMTagger. A small 
number of adaptor sequences were found in the raw data due to small inserts or 
primer-dimers. These adaptor sequences (30-61 bp in length) were easily identified 
using BLASTn and were removed before analysis. Amplicons were then assembled 
into genomes using either CLC Assembly Cell (v. 4.2), for the flow-sorted cells, 
or SPADES* (with the-careful flag) (v.3.6.0) for the hydrogel-captured cells. We 
retained assembled contigs that were at least 500 bp and resultant genomes where 
at least 100 kb could be assembled. 

Filtering single-cell assemblies. To further vet the quality and purity of our 
assemblies, we used BLASTp to assign taxonomies to a set of 31 predetermined 
core genes that are both phylogenetically conserved and single copy in almost all 
genomes™. Although we could not identify the full set of 31 core genes in any of 
the assemblies, we removed several cells in which the core genes reflected mixed 
taxonomies. Additional validations of the single cell assemblies included quanti- 
fying the levels of contamination using CheckM®. We retained cells with less than 
10% putative contamination. We used RNAMMER* to identify 16S sequences 
present in the assembled genome. We discarded a small number of cells that had 
multiple 16S sequences or those in which the RNAMMER- identified 16S rRNA 
sequence conflicted with the Sanger-sequenced 16S rRNA V68 region. Our final 
data set included 196 single cell assemblies (Supplementary Table 4). 

Identification of horizontally transferred genes between divergent genomes. 
To identify horizontally transferred regions, we used a previously benchmarked 
method*®. All assembled and reference genomes were compared in a pairwise man- 
ner using default in BLAST-+ (v. 2.2.24). Recent DNA transfers are defined as the 
presence of near-identical DNA fragments (99% per cent identity or greater) with 
a length of >500 bp in two distantly related genomes. Note that the identification 
of identical DNA does not imply direct transfer between two cells. We found that 
approximately two-thirds of the identified transferred regions contained at least 
one SNP, suggesting that they could not have been the result of contamination. We 
restricted our analysis to comparisons between genomes whose full length 16S dis- 
tance would be at least 3% dissimilar. This cut-off allowed us to distinguish between 
signatures of vertical inheritance and horizontal transfer, as 97% divergence at the 
16S corresponds with roughly 75 million years of evolution, during which time well 
over 100 mutations would have accumulated per 500 bp. By conservatively defining 
horizontally transferred regions as those with such high nucleotide identity, we are 
likely to miss larger horizontally transferred regions that have undergone homolo- 
gous recombination and rearrangement"®. For the HMP reference genomes, most of 
which are draft status, the 16S rRNA sequences were identified using RNAMMER. 
We included only those HMP reference genomes that had less than 10% contam- 
ination as identified by CheckM and in which the annotated 16S region spanned 
a minimum of 200 bp of the V68 region (this excluded 79 genomes), resulting 
in a final set of 387 genomes (Supplementary Table 3). In 78 of the FijiCOMP 
single-cell assembles, RNAMMER identified full length 16S sequences. Multiple 
sequence alignments were performed using the RDP (v.11) Infernal aligner 
(http://rdp.cme.msu.edu/)**°, which accounts for secondary structure in the 
16S. Per cent identity was calculated either on the full length 16S across sites 
that were present in 95% of cells or in a trimmed alignment of the V68 region, 
accounting for sites that were conserved in at least 95% of cells. By comparing per 
cent identity cut-offs of the hypervariable V68 region with the full-length 16S gene, 
we determined that a 95% identity cut-off within the V68 region was an equivalent 
cut-off to 97% identity between full-length 16S genes (Extended Data Fig. 8a). 
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Cells that have unusual or multiple divergent 16S sequences can cause highly con- 
served (e.g., ribosomal) genes to appear horizontally acquired. Although some 
ribosomal genes may be transferred, especially across related species, because they 
confer antibiotic resistance, to minimize the contribution of closely related strains, 
we excluded all HGT events inferred between a pair of cells for which any riboso- 
mal gene was inferred to be transferred. HGT was observed to occur less frequently 
between cell pairs as the phylogenetic distance between them increased (Extended 
Data Fig. 8b), as previously reported’. 

To illustrate the diversity of genomes used in our study and their sources, a 

maximum likelihood-based phylogenetic tree was constructed with FastTree2 
(v.2.1.3)°” (GTR nucleotide substitution model) using a multiple sequence align- 
ment generated by RDP of the full length 16S sequences where available and the 
16S V68 region those were full length sequences could not be identified. 70 of 
the hydrogel-captured single-cell genomes lacked 16S rRNA sequences and were 
therefore not included in this tree. The tree was rooted using Archeal taxa as the 
outgroup, although placement of the root within Bacteria is unsupported. 
Creating a non-redundant mobile gene set. In order to align reads to genes to 
attain relative gene abundances, we built a non-redundant gene data set (Extended 
Data Fig. 2). After pair-wise BLASTs between genomes (Step1), we clustered hori- 
zontally transferred regions using single-linkage clustering (Step2), identifying 
all with partial or full overlapping regions. Next, we identified ORFs within each 
contig using Prodigal (v.2.5)*8 (Step3). We included ORFs that overlapped at a 
minimum of 50% with a horizontally transferred region. This cut-off avoids false 
positive genes that overlap minimally with horizontally transferred regions, but 
allows for the inclusion of genes that may have been truncated due to highly frag- 
mented draft genomes. We then clustered ORFs from each group of overlapping 
transferred regions into non-redundant sets using UCLUST® (v.1.5.579), with 
a 90% identity cut-off (Step4). The vast majority of genes were within 99-100% 
identity with the gene chosen for read alignment (that is, the centroid) (Extended 
Data Fig. 9). No gene with more than 10% ambiguous base pairs was chosen for 
read alignment. Since non-overlapping horizontally transferred regions may con- 
tain identical genes, we performed an additional final BLASTn search between 
genes to further reduce redundancy, although this step resulted in the removal of 
a relatively small number of genes (Step5). 
Functional annotation of horizontally transferred genes. We relied on several 
methods for functional annotation. Details on the functional annotation of each 
gene are provided in Supplementary Table 5. We queried sequences using BLASTp 
against the Kyoto Encyclopedia of Genes and Genomes (KEGG)”, using BLASTp 
to attain their KO classifications (e value 10°, per cent identity 30%); against the 
Clusters of Orthologous Groups (COG) database using rpsBLAST (e value 10~>, 
per cent identity = 30%); and the TIGRFAM (v12.0) and PFAM (v.26) databases 
using HMMER (v.3.0) (e value 10-4, score 22). KO numbers were mapped directly 
back to COG; and TIGRFAM roles that could not be assigned COGs retained 
the TIGRFAM designation. For each gene cluster, functional annotations were 
aggregated by retaining annotations to any gene within the cluster first according 
to COG classification, followed by KEGG, TIGRFAM and finally PEAM. We also 
assigned metabolic functions using the Automated Carbohydrate-Active Enzyme 
Annotation (http://csbl.bmb.uga.edu/dbCAN/)"" (v.4.0), which employs an HMM- 
based search protocol. These were then assigned substrate categories”’. To annotate 
antibiotics resistance genes, we used the ResFam” core genes database, which uses 
HMMs. Still, 32.8% of our HGT gene set could not be annotated by any of these 
means. To annotate the type of mobile genetic element specifically, we performed 
a keyword search amid the gene descriptions as follows: 

Transposon: transpos*; insertion; resolv*; Tra[A-Z]; Tra[0-9]; IS[0-9]; “con- 
jugate transposon” 

Plasmid: resolv*; relax*; conjug*; trb; mob*; plasmid; “type IV”; toxin; “chro- 
mosome partitioning”; “chromosome segregation” 

Phage: capsid; phage; tail; head; “tape measure”; antitermination 

Other HGT machinery: integrase; excision*; exonuclease; recomb; toxin; 
CRISPR; restrict*; resolv*; topoisomerase; “reverse transcrip” 

Carbohydrate-active enzymes and related proteins: Genes present in the CAZY 
database; glycosyltransferase; “glycoside hydrolase; xylan; monooxygenase; rham- 
nos*; cellulose; sialidase; “ose; acetylglucosaminidase; cellobiose; galact*; fructose; 
aldose; starch; mannose; mannan*; glucan; lyase; glycosyltransferase; glycosidase; 
pectin; SusD; SusC; fructokinase; galacto*; arabino*; 

Proteins conferring antibiotic resistance: Genes present in the ARDB; multi- 
drug; “azole resistance”; antibiotic resistance”; TetR; “; “tetracycline resistance”; 
VanZ; betalactam*; beta-lactam; antimicrob*; lantibio* 

DNA extraction of FijiCOMP samples and sequencing. DNA from saliva samples 
was extracted using the Promega Maxwell Buccal Swab LEV DNA Purification 
Kit. Stool samples were collected within 24h of saliva sample collection. DNA 
from stool samples was extracted using the MoBio Laboratories PowerSoil 96 Well 
DNA Isolation Kit with an added proteinase K step to aid in lysis (100 j1g/ml final 
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concentration at 55°C for 20 min). Metagenomic samples were barcoded, multi- 
plexed and sequenced across several lanes on the Illumina HiSeq 2000 platform 
(101 bp paired-end reads), in excess of 8 Gb of sequence per stool sample and 
10 Gb per saliva sample. 

Alignments of metagenomic data. The metagenomic samples analysed include 81 
stool samples from the Human Microbiome Project (Supplementary Table 1) and 
172 stool samples and 134 saliva samples from the FijiCOMP study (Supplementary 
Table 2). Raw metagenomic sequences from the FijiCOMP study were preproc- 
essed in a manner equivalent to that for the HMP metagenomic samples!°: they 
were initially filtered for quality (Phred score > 3), minimum length (60 bp), and 
minimizing reads originating in human genomes, using BMTagger (ftp://ftp.ncbi. 
nlm.nih.gov/pub/agarwala/bmtagger/ NCBI/NLM/NIH; 07 March 2011, version 
3.101). Sequencing depths of the final sample set were comparable between the 
FijiCOMP and HMP cohorts. 

Reads were aligned to the non-redundant set of mobile genes (described above) 
using the BWA (v.0.7.12-r1039)*? mem algorithm. A read pair is considered 
mapping when at least one read of the pair is mapped to the reference gene with 99% 
or greater sequence identity and aligned with at least half of the read length so as to 
avoid edge effects. In the vast majority of reads, the entirety of the read aligned with 
99% or greater identity (Extended Data Fig. 10). After alignment, we considered 
only those genes which were had at least 80% coverage across the length of the gene, 
and had a median alignment depth of at least 4 reads per bp. In order to compare 
between samples that had varying read depth and across genes of varying length, 
we used a measure of fragments per kilobase of protein coding sequence per million 
mapped reads (FPKM), where each fragment is defined as a read pair. Alignments 
of the stool samples were observed for 88.3% of the 10,461 identified unique genes. 

We did not consider minimum or maximum gene lengths for inclusion in our 
data set. The minimum gene length observed is 60 bp and the median length for 
all of the genes is 654 bp. Although genes of different lengths may be expected to 
be able to recruit metagenomic reads and meet the gene coverage criteria, gene 
length was not significantly correlated with prevalence, median or maximum 
abundance. 

Some of the mobile genes are likely duplicated trans-acting core genes that con- 
fer antibiotic resistance, such tRNA synthetases***”, FPKM values for these genes 
will be artificially inflated due to the false-positive recruitment of reads matching 
core genes. We presume that this component is small and given that they may be 
able to recruit reads from various genomes, their distribution across populations 
may serve to minimize differences between populations. 

Assessing GH13 prevalence and abundance in entire metagenome. To determine 
whether any GH13-family genes, including those not identified here as mobile, 
were enriched in the FijiCOMP cohort, we aligned translated reads from each 
of the FijiCOMP and HMP metagenomic samples using PAUDA (v.1.0.1) toa 
database containing 16,244 GH13-family protein sequences annotated as such 
by the Carbohydrate-Active enZYmes Database (http://www.cazy.org) as well 
as 48 GH13-family protein sequences identified as mobile using the FijiCOMP 
single-cell and HMP assemblies. This database amounted to 11,236 unique 
protein-coding genes, of which 38 were identified as horizontally transferred. We 
then calculated coverage RPKMs (reads per kilobase of protein coding sequence 
per million mapped reads) for each of the genes in each of the samples and analysed 
3,770 unique proteins with RPKM values greater than 100 RPKM. The distri- 
bution of GH13 genes differed between populations (P < 10-°, PERMANOVA 
test of Jensen-Shannon divergences, 10‘ permutations). 351 and 63 of the GH13 
family genes were more prevalent in the FijiCOMP and HMP cohorts, respectively 
(q < 0.001 after FDR correction for Fisher's exact test). 347 genes (including 29 
of the GH13 horizontally transferred genes) were enriched (q < 0.001 after FDR 
correction of Mann-Whitney tests) in the FijiCOMP data set, compared with only 
118 (including 2 of the GH13 horizontally transferred genes) enriched in the HMP 
data set. We cannot conclusively determine whether the GH13 genes not identified 
in this study as horizontally transferred are part of the core or flexible genome. 
Vector contamination. Cloning vectors may have been used in the sequencing 
of HMP reference and draft assemblies. (Cloning vectors were not used in the 
sequencing of the FijiCOMP single-cell genomes and adapters were removed 
before assembly.) As cloning vector sequence is sometimes retained in the assembly, 
these assemblies were screened for vector contamination against the UniVec data- 
base (http://www.ncbi.nlm.nih.gov/tools/vecscreen/univec/), using suggested 
parameters. Putative contamination was observed in all 387 of the HMP reference 
genomes, although only 102 cells had regions of putative contamination of at 
least 500 bp, which would have affected the method used to identify HGT events 
(Supplementary Table 3). The metagenomic libraries were not created using 
cloning vectors and therefore it may be possible to distinguish between those 
that are contamination (that is, those that failed to recruit reads) and those that 
are present. Only 102 genes linked to possible vector contamination failed to 
recruit reads. 
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tRNA analysis. Metagenomic alignments were also used to determine which 
mobile genes were adjacent to tRNA genes. We could then examine mobile 
elements that had integrated next to tRNAs to determine the taxonomy of the 
bacterial host. tRNAs have stable phylogenies that can be used for phylotyping”’. 
Metagenomic alignments were filtered for paired reads that had only one read- 
pair aligning to a horizontally transferred gene. tRNA genes were identified 
using the program ARAGORN”. Taxonomies of the tRNA genes were ascer- 
tained through BLASTn searches against NT, requiring an alignment of 85 nucle- 
otides at 85% identity. We identified 838 mobile genes adjacent to tRNA genes. 
Of these, 194 had multiple bacterial hosts, multiple isoacceptors, or multiple 
tRNA genes. 394 of the genes adjacent to tRNA genes had at least one bacterial 
host that could be taxonomically identified. Plots reflect the number of reads 
observed. 72.7% of genes adjacent to tRNAs that had annotatable functions were 
genes involved in the process of horizontal gene transfer (phage, transposon, 
plasmid, etc.). 

Defining high confidence mobile genes. We also performed our analysis on the 
mobile gene pools in Fijian and American metagenomes on a subset of higher 
confidence mobile genes representing known HGT machinery genes and/or genes 
on scaffolds that have additional independent support for their phylogenetic 
placement. BLASTn searches were performed for each of the 1,662 FijiCOMP 
single-cell contigs containing a mobile gene against closely related organisms 
where available, resulting in 634 contigs that had additional phylogenetic support 
that could be compared. Additionally, phylogenetic placement was attained by 
examining tRNA genes on paired-reads aligning to the mobile gene set. This 
higher confidence set includes 6,187 unique genes (59.1% of the total mobile 
gene data set). Even with this smaller subset of high confidence genes, distinct 
functional gene pools can be observed between the HMP and FijiCOMP cohorts 
(Extended Data Figs 3, 4). 

Statistical analysis of gene/function presence and abundance across samples. 
FPKMs associated with the same protein family were summed before analysis. To 
assess which genes were enriched in each population, the FPKMs of each gene were 
compared according to country/village using FDR-adjusted Mann-Whitney tests. 
Comparisons of prevalence were tested using Fisher’s exact tests. All analyses were 
performed in R (v.3.1.0). 

Identifying linkage between pairs of horizontally transferred genes. 
Metagenomic alignments were filtered for paired reads that matched two distinct 
genes in the mobile gene set. The HMP and FijiCOMP metagenomic libraries 
were both made with an average insert size of 180 bp. We required both genes to 
have 99% sequence identity or greater to their respective genes. We retained only 
paired reads that had information on alignment and orientation. Analysis of gene 
linkages was performed using the igraph package (v.0.7.1) in R. Multiple contexts, 
as determined by the linkage between mobile genes in the data set, were observed 
for 5,484 out of a total of 9,233 genes.. 

Microbiome composition. The bacterial composition of all samples was deter- 
mined using MetaPhylerSR (v.0.115)*". The default database was supplemented 
with AMPHORA genes from the genome Succinatimonas sp. CAG:777 genome. 
The prevalence and abundance of plant matter was determined after aligning 
metagenomics reads to the SILVA database (v.115)>!. We used FDR-adjusted 
P values of Pearson correlations to determine whether the abundances of any 
genes were significantly associated with the abundances of specific organ- 
isms in each cohort. We found that the abundances of 1,800 out of 8,322 
genes (21.6%) and 1291 out of 6669 (19.4%) genes significantly correlated 
with the abundances of a single organism in the FijiCOMP and HMP cohorts, 
respectively. 

Fijian Diet. The Fijian diet is rich in high-starch foods***, including taro 
(Colocasia esculenta), giant swamp taro (Cyrtosperma chamissonis), giant taro 
(Alocasia macrorhhiza), tannia (Xanthosoma sagittifolium), cassava (Manihot 
esculenta), sweet potato (Ipomoea batatas), yams (Dioscorea spp.), breadfruit 
(Artocarpus Altilis) and plantains (Musa cultivars). 

Fijians rely heavily on seafood as a main source of protein. Artisinal fishing 
is predominant in the study regions®*. The species that are commonly fished 
locally by artisanal fishers are: eastern triangular butterflyfish (Chaetodon baron- 
essa), swallowtail puller (Chromis ternatensis), blue-green damselfish (Chromis 
viridis), threadfin breams (Pentapodus sp.), imitator damselfish (Pomacentrus 
imitator), and two-lined monocle bream (Scolopsis bilineata). 27 species of sea 
cucumbers are commonly fished, though much of this catch is sold interna- 
tionally, including those caught by artisanal fishers. The species that are com- 
monly fished commercially, although consumed less often in the study region, 
are: blue finned rock cod (Cephalopholis microprion), blue-spotted grouper 
(Cephalopholis rodeta), daisy parrotfish (Chlorurus sordidus), orange-spotted 
emperorfish (Lethrinus erythracanthus), thumbprint emperor (Lethrinus harak), 
and snappers (Macolor spp.)*°. 
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Additional fish and shellfish consumed in Fiji, although not assessed for the 
study region include: albacore tuna (Thunnus alalunga), yellowfin tuna (Thunnus 
albacares), skipjack tuna (Katsuwanas pelamis), bigeye tuna (Thunnus obesus), 
Spanish mackerel (Scomberomorus commerson), striped marlin (Tetrapturus 
audax), blue marlin (Makaira mazara), barracuda (Sphyraena sp.), swordfish 
(Xiphias gladius), sailfish (Istiophorus platypterus), opah (Lamprius regius), 
sunfish (Mola mola), mahi mahi (Coryphaena sp.), black snapper (Macolor niger), 
goatfish (Parupeneus barberinus), parrotfish (Scarus sp.), rabbit fish (Siganus pun- 
tatus), peacock cod (Cephalopholis argus), unicornfish (Naso unicornis), cockles 
(Anadara antiquata), freshwater mussels (Batissa violacea) and other unspecified 
reef fish*”. 

Antibiotic Use in Fiji. Specifically, we expected that orally administered 
antibiotics used to treat common ailments and dental infections would be 
the primary target of resistance, and these are limited to beta-lactams (amox- 
icillin, penicillin, fluxocillin), tetracycline, chloramphenicol, quinolones 
(ciprofloxacin), and metronidazole**™. Thus, differences may reflect antibi- 
otic use in other societal sectors, such as in livestock, historical uses of antibiot- 
ics, or the acquisition of multiple antibiotic resistance genes transferred within 
single cassettes. These findings highlight our relatively limited understanding 
of the forces that drive selection for antibiotic resistance within populations, 
and the reservoirs for resistance genes®!. These topics are acutely important in 
Fiji, as Shigella infections are common and resistance to betalactamases, tetra- 
cyclines, chloramphenicol, cephalosporins and quinolones has already been 
reported™. 

Data availability. FijiCOMP metagenomic samples generated for this study are all 
deposited on NCBI’s Short Read Archive under Project Number PRJNA217052. 
The specific SRS numbers corresponding to each study participant’s gut and oral 
microbiome can be found in Supplementary Table 2. The single-cell assemblies 
can be found in the Data section at http://www.fijicomp.org. 
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Extended Data Figure 1 | Phylogeny of assemblies used in the study 
span the bacterial Tree of Life. A phylogenetic tree constructed using a 
multiple sequence alignment of the full 16S rRNA gene or the V68 region 
of the 16S rRNA gene of all reference genomes and single-cell assemblies 
used in this analysis where available. 16S alignments were constructed 
using RDP. The tree was then assembled using FastTree. Support was 

low for all deep branches in the tree, so the archeal branch serves as the 


outgroup for illustrative purposes only. The outer colour bar displays 
taxonomic associations for archea and bacterial phyla. The inner colour 
bar displays the source of that operational taxonomic unit: HMP reference 
cells (n = 387 blue) and FijiCOMP single cell assemblies (n = 110, red). 
16S rRNA gene sequences were not available for 70 FijiCOMP single-cell 
assembles, which are therefore not included in this tree. 
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We used BLAST to identify identical or near identical (>99%) genes d f — 
across our dataset. This step affects a small number of genes, but 
further helps to reduce redundancy in the final dataset. 
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B Align metagenomic reads to mobile gene dataset 


Only include reads that align with 99% identity over 
at least 50% of the read length. 


Only count gene alignments with 80% coverage over their 
length and at least 4x coverage. 
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Extended Data Figure 2 | Methodology for identifying horizontally 
transferred genes and assessing their distribution within the 
metagenomic samples. Horizontally transferred regions were first 
identified using pair-wise BLASTs between HMP reference genomes and 
FijiCOMP single cell assemblies. Open reading frames were annotated 
within the horizontally transferred regions. Genetic redundancy was 
removed in the mobile gene set to ensure accurate abundance estimates 
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using a combination of UCLUST and BLAST. Metagenomic reads were 
then aligned to the data set of unique mobile genes. Alignments were 
filtered to retain only reads that aligned with 99% identity across over 50% 
of their read length. Abundances of genes in the metagenomic samples 
were determined for genes whose alignments had a minimum of 4 x 
alignment depth over 80% of the gene length. 
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Extended Data Figure 3 | The abundance of mobile gene families 

is largely determine by cohort. a, A heat map is plotted showing the 
abundances (FPKM) of mobile genes aggregated by functional gene 
family (COG assignment, KEGG, TIGREAM or PFAM family) within 
each of the metagenomic samples (81 HMP samples and 172 FijiCOMP 
samples). Hierarchical clustering using complete linkage was performed 
on the Euclidean distances between profiles of functional gene families 
across individuals and on the distances between individuals’ mobile 
gene composition. Values are plotted on a logarithmic scale. b, A heat 
map is plotted showing the abundances (FPKM) of only those mobile 
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gene families that were deemed of higher confidence within each of the 
metagenomic samples. These include mobile gene families from mobile 
genes that were annotated as horizontal transfer machinery or had 
additional support for their phylogenetic placement. The placements of 
gene families and individuals were maintained from a for comparative 
purposes. c, A heat map is plotted showing the abundances (FPKM) of 
only those mobile genes that were observed to be transferred between 
HMP reference genomes within each of the metagenomic samples. The 
placements of gene families and individuals were maintained from a for 
comparative purposes. 
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Extended Data Figure 4 | Distributions of GH13 genes and glycoside 
hydrolase families within mobile genes of higher confidence display 
population-specific enrichment. a, Prevalence and abundance (measured 
by FPKM) of mobile genes annotated as members of the GH13 family in 
the FijiCOMP (n= 172, red) and HMP (n= 81, blue) metagenomic stool 
samples. b, Prevalence and abundance of all glycoside hydrolase families 
within the higher confidence mobile gene subset present in the FijiCOMP 


(red) and HMP (blue) metagenomic stool samples. Only unique gene 
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families from mobile genes that were annotated as horizontal transfer 
machinery or had additional support for their phylogenetic placement 


of their substrates. 
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are included here. Abundances were measured by FPKM, aggregated 
according to glycoside hydrolase family, and plotted as a function of the 
density across samples. For each glycoside hydrolase family, the number of 
unique horizontally transferred genes observed is noted, as are the sources 
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Extended Data Figure 5 | Composition of the gut microbiomes of HMP 
and FijiCOMP study participants. a, Relative abundances of bacteria 
according to phylum plotted for metagenomic samples from individuals in 
the HMP (n=81, blue) and FijiCOMP (n =172, red) cohorts. Samples are 
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sorted according to cohort and the abundance of the dominant phyla. 
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b, Relative abundances of families within the order Bacteriodales plotted 
for metagenomic samples from individuals in the HMP (blue) and 

FijiCOMP (red) cohorts. Samples are sorted according to cohort and the 
abundance of the top Bacteroidales member. 
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Extended Data Figure 6 | Mobile genes are observed in a wide variety within the heat map reflect the read abundance according to the species 
of bacterial host backgrounds across the two cohorts. a, b, A heat map associated with the specific tRNA gene. The colour bar shows which 
is plotted showing the number of read-pairs per person that aligned to meteganomic cohort the reads are from: FijiCOMP (red) and HMP (blue). 
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Extended Data Figure 7 | The relative abundances of genes and contexts across populations is not sensitive to precise definitions. Percentages of 
gene families, as determined by COG annotations (left), identical genes (middle) and gene contexts (right) between populations for a wide range of 
parameters. Bars are plotted in 5% increments. Bars shaded in black are the parameters that are plotted in Fig. 4. 
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Extended Data Figure 8 | Horizontal transfer varies across cells at 
different phylogenetic distances. a, Nucleotide identity cut-offs for 

full length 16S rRNA and the V68 16S rRNA region were compared to 
avoid comparisons between closely related cells. For each pair of HMP 
reference genomes, nucleotide identity for their full-length 16S rRNA is 
plotted against that of their V68 regions. 97% identity of full-length 16S 
(corresponding to approximately 75 million years of evolution) was used 
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comparisons contributing to each of the lines. c, HGT frequency plotted as 
a function of the phylogenetic divergence between species between all cell- 
cell comparisons (black), between HMP reference genomes only (blue) 
and between the FijiCOMP single cell assemblies (red). This plot includes 
only cells for which full-length 16S rRNA genes could be identified. 
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and reduce the redundancy of the gene set. This step is essential for 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


> 


w» 300 million 
T 
oO 
v 
6 
o 
2 
E 
=) 
Zz 
0 — et 
300 million 
o 
ise) 
v 
ue 
) 
o 
we) 
E 
2 
0 —— ee ee ee ee ee ee 
I I I I I I 
50 60 70 80 90 100 
Percent identity 
over the entire read length 
B 
500 million 
o 
ise) 
v 
ue 
) 
a 
we) 
E 
=) 
a ee! 
0 
» 300 million 
T 
oO 
v 
re) 
a 
2 
€ 
=) 
Zz 
0 ——— ee ee 


50 60 70 80 90 100 


Percent of the read that aligns 


Wi FijiCOMP metagenomes 
Mi HMP metagenomes 


Extended Data Figure 10 | Metagenomic reads align to mobile genes with high fidelity over their entire length. Metagenomic reads were required 
to align with 99% identity to a mobile gene over at least 50% of the read length. Despite the seemingly low 50% cut-off, almost all reads align with 
near-perfect nucleotide identity over the entire length of the gene. 
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Glial-cell-derived neuroregulators control type 3 
innate lymphoid cells and gut defence 


Sales Ibiza!*, Bethania Garcia-Cassani!*, Hélder Ribeiro!, Tania Carvalho!, Luis Almeida!, Rute Marques?+, 
Ana M. Misic*+, Casey Bartow-McKenney’, Denise M. Larson‘, William J. Pavan‘, Gérard Eberl’, 


Elizabeth A. Grice? & Henrique Veiga- Fernandes!” 


Group 3 innate lymphoid cells (ILC3) are major regulators of 
inflammation and infection at mucosal barriers’. ILC3 development 
is thought to be programmed’, but how ILC3 perceive, integrate 
and respond to local environmental signals remains unclear. 
Here we show that ILC3 in mice sense their environment and 
control gut defence as part of a glial-ILC3-epithelial cell unit 
orchestrated by neurotrophic factors. We found that enteric ILC3 
express the neuroregulatory receptor RET. ILC3-autonomous Ret 
ablation led to decreased innate interleukin-22 (IL-22), impaired 
epithelial reactivity, dysbiosis and increased susceptibility to 
bowel inflammation and infection. Neurotrophic factors directly 
controlled innate [122 downstream of the p38 MAPK/ERK-AKT 
cascade and STAT3 activation. Notably, ILC3 were adjacent to 
neurotrophic-factor-expressing glial cells that exhibited stellate- 
shaped projections into ILC3 aggregates. Glial cells sensed 
microenvironmental cues in a MYD88-dependent manner to control 
neurotrophic factors and innate IL-22. Accordingly, glial-intrinsic 
Myd88 deletion led to impaired production of ILC3-derived IL-22 
and a pronounced propensity towards gut inflammation and 
infection. Our work sheds light on a novel multi-tissue defence 
unit, revealing that glial cells are central hubs of neuron and innate 
immune regulation by neurotrophic factor signals. 

ILC3 produce proinflammatory cytokines, and regulate mucosal 
homeostasis and anti-microbial defence’. In addition to their well- 
established developmentally regulated program, ILC3 are also 
controlled by microbial and dietary signals'~®, suggesting that 
ILC3 possess other unexpected environment-sensing strategies. 
Neurotrophic factors are extracellular environmental cues to neurons 
and include the glial-derived neurotrophic factor (GDNF) family 
ligands (GFL) that activate the tyrosine kinase receptor RET in the 
nervous system, kidney and haematopoietic progenitors” !!. 

Analysis of gut lamina propria revealed that ILC3 express high 
levels of Ret”'* (Fig.1a), a finding confirmed at the protein level and 
by reporter Ret©* knock-in mice! (Fig. 1b-d and Extended Data 
Fig. la-d). ILC3 subsets expressed Ret°'? and aggregated in cryp- 
topatches and isolated lymphoid follicles (ILF), suggesting a role of 
neuroregulators in ILC3 (Fig.1b-d and Extended Data Fig. 1b-j). 
To explore this hypothesis, we transplanted fetal liver cells from 
Ret competent (Ret "/G"?) or deficient (Ret©*?/G*?)? animals into 
alymphoid Ragl~/~yc/~ hosts. Ret-deficient chimaeras revealed 
unperturbed ILC3 and cryptopatch development (Fig. le). Notably, 
IL-22-expressing ILC3 were largely reduced despite normal IL-22 pro- 
duction by T cells (Fig.1f, g). In contrast, innate IL-17 was unaffected 
by Ret ablation (Fig.1f and Extended Data Fig. 2a). In agreement with 
this, analysis of gain-of-function RetM=N?8 mice! revealed a selective 


a Ret b ILC3 
§ 0.9) ILC3 
D Te 0 67.9 2.2 65.7 31.4 L 36.5 54.7, 9.6 
a & i : 
aon G : 
od ii : Fy 
27 i ints sie sie afefeat 
zs (1) 32.1] i3.9 28.2 123.3 8.8] 114.7 21 
c 0 : 
SES TBDe MONG RORyt IL-7Ra CCR6 NKp46 
ROXS) 


ir) 


Retlyverr, mILF 


GFP Chi 
e Ret¢FP chimaeras f ILC3 ¢ is 
a 30 _ 25, 
WT/GEP. GFP/GEP. WT/GFP GFP/GFP => a 
Ret! pe Ret' . Ret' Ret' gs 8 lt 
775) & me 
a 2S 
5 
A [o} 
g ea 
i? Ss 
< =0 Z 0. 
B Ieee NS NS 
S S NS 7 ate 
N S & 
50) — 80) — 2 2) 3 & ors il 
8 iL 3a 3 |L7 se 
= T = 3 a mILF 9.46 697) 3 Ox 
= we 5e Dir & 26 
g gx ye Mcp ui §= 
“TIES £ an 
0 2 20 —. IL 17 —— 0 Ret T/GFP m RetGFP/GFP 
ORetV/GFP gm RetGFP/GFP — RetWT/GFP RetGFP/GFP 
g ___Teells LO3 * = 
25 . 15 
| RetWT/GFP RetGFP/GFP Ret™T RetMEN2B 7 x _ 
< 3 =< 
fey Sk 
® iN 5B 
N 20 
| Ss E=|= 
EF 
ae ~  g ~ ol 2 9 
= — IL-22 — 
~12, NS. 30, NS p 18 NS. 10, NS 
x & = oO RetWT/GFP = & 
a ag I mi RetSFP/crP 6.67 9.87 = as 
3 ie 3 || Lee 
8 6S =| 3% 
. 53 eso |= Bg 
5 : 5 
+ so iam KR 20 
q EL 7 eal 
ai 5 aah 5 
=> Z0 en a) 24 
—=— ILA -— IRet"T — m RetMEN28 


Figure 1 | The neurotrophic factor receptor RET drives enteric ILC3- 
derived IL-22. a, Relative expression of Ret in lymphoid tissue initiator 
cells (LTi), natural cytotoxicity receptor (NCR)~ and NCR* ILC3 subsets, 
T cells (T), B cells (B), dendritic cells (Dc), macrophages (Mo), enteric 
neurons (N) and mucosal glial cells (G). b, Ret!” ILC3 subsets in the 
gut. c, Left, RetC"? gut. Right, ILC3 aggregates. White, GFP. d, Enteric 
cryptopatches (CP), immature (iILF) and mature (mILF) isolated 
lymphoid follicles. Green, RET/GFP; blue, ROR4t; red, B220. e, Small 
intestine of Ret©*? chimaeras (n= 15). f, g, Enteric ILC3 and T cells from 
RetSF? chimaeras. RetW!/GFP, n = 25; RetGFP/GFP. 4 — 22. h, Enteric ILC3 
from RetM=N?8 mice (n =7). Scale bars, 1 mm (c left, e); 50 yum (c right); 
301m (d). Data are representative of 4 independent experiments. Error 
bars show s.e.m. *P < 0.05, **P < 0.01; NS, not significant. 
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increase of IL-22-producing ILC3, whereas their IL-17-producing 
counterparts were unaffected (Fig.1h and Extended Data Fig. 2b). To 
more specifically evaluate the effects of RET in ILC3, we deleted Ret 
in ROR}t-expressing cells by breeding Rorgt-Cre with Ret!" mice!>!6 
(Extended Data Fig. 3a, b). Analysis of Rorgt-CreRet/! (Ret) mice 
revealed selective and extensive reduction of ILC3-derived IL-22, but 
normal IL-22-producing T cells (Fig. 2a and Extended Data Fig. 3c, d). 
IL-22 acts on epithelial cells to induce reactivity and repair genes’. 
When compared to their wild-type littermate controls, the Ret® 
epithelium revealed normal morphology, proliferation and paracel- 
lular permeability, but a marked reduction of epithelial reactivity and 
repair genes (Fig. 2b and Extended Data Fig. 3e-h). Accordingly, the 
RetM"N28 epithelium displayed increased levels of these molecules in 
an IL-22-dependent manner (Fig. 2b and Extended Data Fig. 3i). These 
results indicate that RET signals selectively control innate IL-22 and 
shape intestinal epithelial reactivity. 

To determine whether neurotrophic factors regulate intestinal 
defence, we tested how varying degrees of RET signals control enteric 
aggressions. Whereas Ret“ mice treated with dextran sodium sulfate 
(DSS) had increased weight loss and inflammation, reduced IL-22- 
producing ILC3, decreased epithelial reactivity/repair genes and 
pronounced bacterial translocation from the gut, RetMEN28 mutants 
were highly protected compared to their wild-type littermate controls 
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Figure 2 | ILC3-intrinsic RET signals regulate gut defence. a, ILC3- 
derived cytokines (n = 11). b, Gene expression in the epithelium of Ret* 
and RetM=N?8 mice compared to their wild-type littermate controls (n =7). 
c-f, DSS treatment in Ret and Ret“ mice (n=8). c, Histopathology. 

d, Inflammation score and colon length. e, Innate IL-22. f, Bacterial 
translocation. CFU, colony-forming unit. g-j, DSS treatment in Ret“? 
and RetMEN?B mice (n=8). g, Histopathology. h, Inflammation score and 
colon length. i, Innate IL-22. j, Bacterial translocation. mLN, mesenteric 
lymph node. k-n, C. rodentium infection in Rag1~/~ Ret" (n=15) and 
Ragi~'~ Ret* (n= 17) mice. k, Histopathology. 1, Inflammation score and 
colon length. m, Innate IL-22. n, Infection burden. Scale bars, 200 jim. 
Data are representative of 4 independent experiments. Error bars show 
sem, *P< 0.05, ** P< 0.01. 
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(Fig. 2c-j and Extended Data Fig. 4). As DSS largely causes epithelial 
injury, we tested whether ILC3-autonomous RET signals are required to 
control infection. To this end, Ret mice were bred with Rag1~'~ mice 
to formally exclude adaptive T-cell effects. The resulting Rag]~/~Ret* 
mice were infected with the attaching and effacing bacteria Citrobacter 
rodentium. When compared to their littermate controls, Ragl ~'-RetS 
mice had marked gut inflammation, reduced IL-22-producing 
ILC3, increased C. rodentium infection and translocation, reduced 
epithelial reactivity genes, increased weight loss and reduced survival 
(Fig. 2k-n and Extended Data Fig. 5). Altogether, these data indicate that 
ILC3-intrinsic neurotrophic factor cues regulate gut defence and 
homeostasis. 

We used a multi-tissue organoid system to show that IL-22 is the 
molecular link between RET-dependent ILC3 activation and epithe- 
lial reactivity. Addition of GFL to ILC3-epithelial organoids strongly 
induced epithelial reactivity genes in an IL-22- and RET-dependent 
manner (Fig. 3a, b and Extended Data Fig. 6a). To further examine 
how RET signals control innate IL-22, we investigated a gene signature 
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Figure 3 | ILC3-autonomous RET signals directly control 1122 
downstream of pSTAT3. a, b, Epithelial-ILC3 organoids (n= 9). c, Gene 
expression in Ret ILC3 compared to their wild-type controls (n= 4). 

d, Gene expression in ILC3 after activation by GFL (n = 4). e, Enteric 

Ret ILC3. pERK, n= 8; pAKT, n= 12; phosphorylated p38/MAP kinase, 
n= 6; pSTAT3, n= 14. MFI, mean fluorescence intensity. f, Enteric ILC3 
activation by GFL. pERK, n= 10; pAKT, n= 16; phosphorylated p38/ 
MAP kinase, n = 3; pSTAT3, n= 15. g, pSTAT3 expression in enteric ILC3 
cultured with medium (n= 7), GFL (n= 11) or GFL and inhibitors for: 
p38 MAPK/ERK-AKT (LY) (n=7); ERK (PD) (n=7); AKT (VIII) (n=8); 
and p38 MAPK (SB) (n=6). h, 1/22 in enteric ILC3 cultured with GFL 
(n= 17) or GFL and the inhibitors LY (n= 18); PD (n= 16); VIII (n= 15); 
SB (n= 15); and the STAT3 inhibitor (S31) (n = 8). i, 1122 locus. The letters 
a-i in this panel indicate putative STAT3 binding sites and primer location 
(see Methods). j, ChIP analysis of ILC3 stimulated with GFL (n= 10). Data 
are representative of 3 independent experiments. Error bars show s.e.m. 
*P<0.05, **P<0.01. 


21 JULY 2016 | VOL 535 | NATURE | 441 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


associated with ILC identity!. Whereas the master ILC transcription 
factors Runx1, Id2, Gata3, Rora, Rorgt, Ahr and Stat3 were unperturbed, 
1122 was significantly reduced in Ret* ILC3 (Fig. 3c and Extended Data 
Fig. 6b). Accordingly, activation of ILC3 with all or distinct GFL- 
GFRa pairs in trans efficiently increased [122 despite normal expres- 
sion of other ILC3-related genes (Fig. 3d and Extended Data Fig. 6c). 
Activation of RET by GFL leads to p38 MAPK/ERK-AKT cascade 
activation in neurons, whereas phosphorylation of STAT3 shapes 1/22 
expression”!”, Analysis of Ret ILC3 revealed hypo-phosphorylated 
ERK1/2, AKT, p38/MAP kinase and STATS (Fig. 3e and Extended 
Data Fig. 6d). Accordingly, GFL-induced RET activation in ILC3 led 
to rapid ERK1/2, AKT, p38/MAP kinase and STAT3 phosphoryla- 
tion and increased [122 transcription (Fig. 3d, fand Extended Data 
Fig. 6e, f). Accordingly, inhibition of ERK, AKT or p38/MAP kinase upon 
GFL activation led to impaired STATS activation and [122 expression 
(Fig. 3g, h). Finally, inhibition of STAT3 upon GFL-induced RET 
activation led to decreased [/22 (Fig. 3h). To examine whether GFL 
directly regulate 1/22, we performed chromatin immunoprecipita- 
tion (ChIP)!8 (Fig. 3i, j). Stimulation of ILC3 with GFL resulted in 
increased binding of pSTAT3 in the 1/22 promoter and increased 
trimethyl-H3K36 at the 3' end of 1122, indicating active [/22-transcribed 
regions’? (Fig. 3d, j). Thus, cell-autonomous RET signals control ILC3 
function and gut defence by direct regulation of 1/22 downstream of 
STAT3 activation. 

Propensity towards inflammation and dysregulation of intestinal 
homeostasis have been associated with dysbiosis”°?!. When compared 
to their wild-type littermates, Ret mice have altered microbial 
communities as evidenced by quantitative analysis, weighted UniFrac 
analysis and significantly altered levels of Sutterella, unclassified 
Clostridiales and Bacteroides (Fig. 4a and Extended Data Fig. 7). 
Discrete microbial communities may have transmissible colitogenic 
potential?°!, Nevertheless, germ-free mice colonized with the microbi- 
ota of Ret or their control littermates revealed similar susceptibility to 
DSS-induced colitis and identical innate IL-22 production (Fig. 4b-d). 
In agreement, co-housed Ret* and wild-type littermates had differ- 
ent propensity towards intestinal inflammation (Fig. 2c, d). Together, 
these data indicate that dysbiosis per se is insufficient to cause altered 
innate IL-22 and susceptibility to gut inflammation as observed in Ret* 
mice (Fig. 2c-f). Thus, we hypothesised that GFL-producing cells inte- 
grate commensal and environmental signals to control innate IL-22. 
Accordingly, antibiotic treatment of Ret“ and their wild-type littermate 
controls resulted in similar ILC3-derived IL-22 (ref. 22) (Fig. 4e). 

Neurotrophic factors of the GDNF family were shown to be 
produced by enteric glial cells, which are neuron-satellites expressing 
the glial fibrillary acidic protein (GFAP)””?. Notably, double reporter 
mice for ILC3 (Ret@f®) and glial cells (Gfap-CreRosa26*¥? ) revealed that 
stellate-shaped projections of glial cells are adjacent (4.35 + 1.42 1m) 
to RORyt* ILC3 within cryptopatches (Fig. 4f and Extended Data 
Fig. 8a). These data suggest the existence of paracrine glial-ILC3 cross- 
talk orchestrated by neurotrophic factors. Accordingly, lamina propria 
glial cells were the main producers of GFL (Extended Data Fig. 8b). 
Recent studies have shown that glial cells express pattern recognition 
receptors, notably Toll-like receptors (TLRs)**”°. Activation of 
neurosphere-derived glial cells revealed they specifically respond 
to TLR2, TLR4, and the alarmins IL-1 and IL-33, which efficiently 
controlled GFL expression and induced robust innate 1/22 ina MYD88- 
dependent manner (Fig. 4g-i and Extended Data Fig. 8c-g). To formally 
demonstrate the physiological importance of MYD88-dependent glial 
cell sensing on innate IL-22 production, we deleted Myd88 in GFAP- 
expressing glial cells by breeding Gfap-Cre with Myd83"" mice”®’. 
Remarkably, glial-intrinsic deletion of Myd88 resulted in decreased 
intestinal GFL, increased gut inflammation, impaired ILC3-derived 
IL-22, and increased weight loss (Fig. 4j-m; Extended Data Fig. 9a-d). 
In agreement, Gfap-CreMyd88* mice had increased susceptible to 
C. rodentium infection (Extended Data Fig. 9e-h). Thus, mucosal 
glial cells orchestrate innate IL-22 via neurotrophic factors, 
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Figure 4 | Glial cells set GFL expression and innate IL-22 via MYD88- 
dependent sensing of the microenvironment. a, Weighted Unifrac PCoA 
analysis and genus-level comparisons from co-housed Ref!" (white circles) 
and Ret® (black circles) littermates (n =5). Purple, unclassified $24-7; 
red, Bacteroides; green, Sutterella; blue, unclassified Clostridiales; grey, 
other. b-d, DSS treatment of germ-free (GF) mice (n =5) colonized with 
microbiota from Ref! or Ret’ mice. b, Histopathology. c, Inflammation 
score. d, Innate IL-22. e, Innate IL-22 after antibiotic treatment (n= 8). 

f, RetS?” Gfap-CreRosa26*!? mice. Green, RET/GFP; red, GFAP/REP. 

g, h, Glial cell activation with TLR2, TLR4, IL-18 receptor and IL-33 
receptor ligands (n = 6). i, TLR ligands, IL-18 and IL-33 activation of 
co-cultured ILC3 with wild-type (white bars) or Myd88~/~ glial cells 
(black bars) (n = 6). j-m, DSS treatment of Gfap-CreMyd88* mice 

(n= 12). j, Histopathology. k, Inflammation score and colon length. 

1, Innate IL-22. m, Body weight. Scale bars, 200 1m (b, j); 10j1m (f). Data 
are representative of 3-4 independent experiments. Error bars show s.e.m. 
*P<0.05, **P<0.01. 


downstream of MYD88-dependent sensing of commensal products and 
alarmins. 

Defining the mechanisms by which ILC3 integrate environmental 
cues is critical to understanding mucosal homeostasis. Our work sheds 
light on the relationships between ILC3 and their microenvironment, 
notably through decoding a novel glial-ILC3-epithelial cell unit 
orchestrated by neurotrophic factors (Extended Data Fig. 10). 
Glial-derived neurotrophic factors operate in an ILC3-intrinsic manner 
by activating the tyrosine kinase RET, which directly regulates innate 
IL-22 downstream of p38 MAPK/ERK-AKT and STAT3 phosphoryl- 
ation (Extended Data Fig. 10). Future studies will further elucidate the 
mechanisms inducing RET expression in ILC3. 

Our data demonstrate that, in addition to their well-established 
capacity to integrate dendritic-cell-derived cytokines’, ILC3 perceive 
distinct multi-tissue regulatory signals leading to STAT3 activity 
and IL-22 expression, notably by integration of glial-cell-derived 
neuroregulators. Thus, rather than providing hard-wired signals for 
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ILC3-immunity, we propose that RET signalling induces fine-tuned 
innate IL-22 production that leads to efficient gut homeostasis and 
defence. 

Previous studies demonstrated that neurons may indirectly shape 
fetal lymphoid tissue inducer cell aggregation via regulation of mes- 
enchymal cells and that ablation of glial cells leads to gut inflamma- 
tion?®°; here we reveal that glial cells are central hubs of neuronal 
and innate immune regulation. Notably, neurotrophic factors are the 
molecular link between glial cell sensing, innate IL-22 and intestinal 
epithelial defence. Thus, it is tempting to speculate that glial-immune 
cell units might be also critical to the homeostasis of other barriers, 
notably in the skin, lung and brain*”. From an evolutionary perspective, 
coordination of innate immunity and neuronal function may ensure 
efficient mucosal homeostasis and a co-regulated neuro-immune 
response to various environmental challenges, including xenobiotics, 
intestinal infection, dietary aggressions and cancer. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. The experiments were not randomized. The investigators were not 
blinded to allocation during experiments and outcome assessment, unless stated 
otherwise. Power analysis was used to estimate sample size. 

Mice. C57BL/6J mice were purchased from Charles River. Ret? (ref. 13), 
Ragl~'~yc~'~ (refs 31, 32), RetM®N?5 (ref. 14), Rosa26‘"? (ref. 33), Rosa26®"” 
(ref. 34), Ret" (ref. 16), Rorgt-Cre!°, Il1b~'~ (ref. 35) and Myd88~'~ (ref. 36) were 
in a full C57BL/6) background. Gfap-Cre*® bred to Mydssi" (ref. 27) were in 
F8-F9 to a C57B1/6) background. All lines were bred and maintained at IMM 
Lisboa animal facility. Mice were systematically compared with co-housed litter- 
mate controls. Both males and females were used in this study. Randomization 
and blinding were not used unless stated otherwise. All animal experiments were 
approved by national and institutional ethical committees, respectively, Diregao 
Geral de Veterinaria and iMM Lisboa ethical committee. Germ-free mice were 
housed at Instituto Gulbenkian de Ciéncia, Portugal, and Institut Pasteur, France, 
in accordance to institutional guidelines for animal care. Power analysis was 
performed to estimate the number of experimental mice. 

Generation of fetal liver chimeras. For reconstitution experiments, 5 x 10° fetal 
liver cells were isolated from E14.5 Ret" 1/C¥? or RetS#?/GF? mice and injected intra- 
venously into non-lethally irradiated (200 rad) alymphoid Rag1~'~yc-~ hosts. 
Mice were analysed 8 weeks after transplantation. 

DSS-induced colitis. Dextran sodium sulfate (DSS) (molecular mass 36,000- 
50,000 Da; MP Biomedicals) was added into drinking water 3% (w/v) for 5 days 
followed by 2 days of regular water. Mice were analysed at day 7. Body weight, 
presence of blood and stool consistency was assessed daily. 

Citrobacter rodentium infection. Infection with Citrobacter rodentium ICC180 
(derived from DBS100 strain)*” was performed by gavage inoculation of 10° 
colony-forming units*”**, Acquisition and quantification of luciferase signal was 
performed in an IVIS system (Caliper Life Sciences). Throughout infection, weight 
loss, diarrhoea and bloody stools were monitored daily. 

Antibiotic treatment. Pregnant females or newborn mice were treated with strep- 
tomycin 5 gl“', ampicillin 1 gl“! and colistin 1 g1~' (Sigma-Aldrich) into drinking 
water with 3% sucrose. Control mice were given 3% sucrose in drinking water as 
previously described”. 

Microscopy. Intestines from Ret“? and Ret©"” chimaeras were imaged in a Zeiss 
Lumar V12 fluorescence stereo microscope with a NeoLumar S 0.8 x objective 
using the GFP filter. Whole-mount analysis was performed as previously 
described”. Briefly, adult intestines were flushed with cold PBS (Gibco) and 
opened longitudinally. Mucus and epithelium was removed and intestines were 
fixed in 4% PFA (Sigma-Aldrich) at room temperature for 10 mins and incubated in 
blocking/permeabilizing buffer solution (PBS containing 2% BSA, 2% goat serum, 
0.6% Triton X-100). To visualise three-dimensional structures of the small intestine, 
samples were cleared with benzyl alcohol-benzyl benzoate (Sigma-Aldrich) before 
dehydration in methanol”. For analysis of thick gut sections, intestines were 
fixed with 4% PFA at 4°C overnight and were then included in 4% low-melting 
temperature agarose (Invitrogen). Sections of 100|1m were obtained with a Leica 
VT1200/VT1200 S vibratome and embedded in Mowiol (Calbiochem)”. Slides or 
whole-mount samples were incubated overnight or for 1-2 days respectively at 4°C 
using the following antibodies: rat monoclonal anti-B220 (RA3-6B2) (eBioscience), 
mouse monoclonal anti-ROR}t (Q31-378) (BD Pharmigen), mouse monoclonal 
anti-GFAP (GA-5) (Sigma-Aldrich), mouse monoclonal anti-GFAP Cy3 (GA-5) 
(Abcam), anti-GDNF antibody (Abcam), DAPI (4',6-Diamidino-2-Phenylindole, 
Dihydrochloride) (Invitrogen). A647 goat anti-rat, A568 goat anti-rat, A647 goat 
anti-mouse, A488 rabbit anti-GFP, and A488 goat anti-rabbit secondary antibodies 
were purchased from Invitrogen. Neurospheres and cultured glial cells were fixed in 
PFA 4% for 10 minutes at room temperature and permeabilized in PBS-Triton 0.1% 
within 30 seconds. After several washing steps with PBS, cells were incubated with 
antibodies for 3h at room temperature and then mounted in Mowiol*®. Samples 
were acquired on a Zeiss LSM710 confocal microscope using EC Plan-Neofluar 
10 /0.30 M27, Plan Apochromat 20/0.8 M27 and EC Plan-Neofluar 40 x/1.30 
objectives. Three-dimensional reconstruction of images was achieved using 
Imaris software and snapshot pictures were obtained from the three-dimensional 
images. For analysis of confocal images, cells were counted using in-house software, 
written in MATLAB (Mathworks, Natick, MA). Briefly, single-cell ILC3 nuclei 
were identified by RORt by thresholding and particle analysis. Regions of interest 
(ROIs) (Extended Data Fig.1i; bottom panels) were defined from each nucleus for 
analysis in the GFP channel, where staining was considered positive if a minimum 
number of pixels (usually 20) were above a given threshold. The software allows 
for batch processing of multiple images and generates individual report images 
for user verification of cell-counting results and co-expression analysis (https:// 
imm.medicina.ulisboa.pt/en/servicos-e-recursos/technical-facilities/bioimaging). 
Histopathology analysis. Colon samples were fixed in 10% neutral buffered for- 
malin. The colon was prepared in multiple cross-sections or ‘swiss roll’ technique”, 


routine-processed for paraffin embedding and 3—41m sections were stained with 
haematoxylin and eosin. Enteric lesions were scored by a pathologist blinded 
to experimental groups, according to previously published criteria*'~*’. Briefly, 
lesions were individually scored (0-4 increasing severity) for the following criteria: 
1, mucosal loss; 2, mucosal epithelial hyperplasia; 3, degree of inflammation; 
4, extent of the section affected in any manner; and 5, extent of the section affected 
in the most severe manner as previously described’. Final scores were derived by 
summing the individual lesion and the extent scores. The internal diameter of the 
crypts was measured in at least five fields (10x magnification), corresponding 
to the hotspots in which the most severe changes in crypt architecture were 
seen. Measurements were performed in an average of 35 crypts per mouse, from 
proximal to distal colon. Intestinal villus height was measured in the jejunum. 
Measurements were performed in slides scanned using a Hamamatsu Nanozoomer 
SQ digital slide scanner running NDP Scan software. 

Enteric glial cell isolation. Enteric glial cells isolation was adapted from previously 
described protocols****. Briefly, the muscularis layer was separated from the 
submucosa with surgical forceps under a dissection microscope (SteREO Lumar. 
V12, Zeiss). The lamina propria was scraped mechanically from the underlying 
submucosa using 1.5-mm cover-slips (Thermo Scientific). Isolated tissues were 
collected and digested with Liberase TM (7,5 1g ml~!; Roche) and DNase I 
(0.1 mgml'; Roche) in RPMI supplemented with 1% HEPES, sodium pyruvate, 
glutamine, streptomycin and penicillin and 0.1% 6$-mercaptoethanol (Gibco) 
for approximately 40 min at 37°C. Single-cell suspensions were passed through a 
100-{1m cell strainer (BD Biosciences) to eliminate clumps and debris. 

Flow cytometry and cell sorting. Lamina propria cells were isolated as previously 
described“. Briefly, intestines were digested with collagenase D (0.5mg ml |; 
Roche) and DNase I (0.1 mg ml}; Roche) in RPMI supplemented with 10% FBS, 
1% HEPES, sodium pyruvate, glutamine, streptomycin and penicillin and 0.1% 
8-mercaptoethanol (Gibco) for approximately 30 min at 37°C under gentle 
agitation. For cytokine analysis, cell suspensions were incubated 4h in PMA/ 
ionomycin (Sigma-Aldrich) and brefeldin A (eBioscience) at 37°C. Intracellular 
staining was performed using IC fixation/permeabilization kit (eBioscience). Cells 
were stained using PBS, 1% FBS, 1% HEPES and 0.6% EDTA (Gibco). Flow cytom- 
etry analysis and cell sorting was performed using FORTESSA and FACSAria flow 
cytometers (BD Biosciences). Data analysis was performed using FlowJo software 
(Tristar). Sorted populations were >95% pure. Cell suspensions were stained 
with anti-CD45 (30-F11), anti-TER119 (TER-119), TCR8 (H57-597), anti-CD3< 
(eBio500A2), anti-CD19 (eBio1D3), anti-NK1.1 (PK136), anti-CD11c (N418), 
anti-Grl (RB6-8C5), anti-CD11b (Mi/70), anti-CCR6 (29-2L17), anti-CD127 
(IL-7Ra; A7R34), anti-Thy1.2 (53-2.1), anti-CD49b (DX5), anti-TCR65 (GL3), 
anti-NKp46 (29A1.4), anti-IL-17 (eBiol17B7), anti-IL-22 (1H8PWSR), rat IgG1 
isotype control (eBRG1) antibodies, 7AAD viability dye, anti-mouse CD16/ 
CD32 (Fc block), anti-RORyt (AFKJS-9); rat IgG2a,, isotype control (eBR2a) and 
streptavidin fluorochrome conjugates all from eBioscience; anti-CD4 (GK1.5), 
anti-CD31 (390), anti-CD8qa (53-6.7), anti-CD24 (M1/69), anti-Epcam (G8.8) 
antibodies were purchased from Biolegend. Anti-RET (IC718A) antibody was 
purchased from R&D Systems. LIVE/DEAD Fixable Aqua Dead Cell Stain 
Kit was purchased from Invitrogen. Cell populations were defined as: ILC3 - 
CD45*Lin-Thy1.2"IL7Ra*+ROR tt. For ILC3 subsets, additional markers were 
employed: LTi - CCR6*Nkp46~; ILC3 NCR~ - CCR6~Nkp46~; ILC3 NCR* - 
CCR6 Nkp46~. Lineage was composed from CD3e, CD8a, TCR3, TCR 6, 
CD19, Grl, CD11c and TERI19. Glial cells - CD45~ CD31~ TER119" CD49b* 
(ref. 47); T cells - CD45*CD3et; 46 T cells - CD45*CD3e*y5TCR’; B cells - 
CD45*+CD19*B220*; macrophages - CD45*CD11b*F4/80*; dendritic cells - 
CD45*CD19- CD3e~ MHCII* CD 1 1c"; enteric neurons - CD45” RET/GEP* 
(ref. 13); epithelial cells - CD45~CD24*Epcam’™. 

Quantitative RT-PCR. Total RNA was extracted using RNeasy micro kit 
(Qiagen) or Trizol (Invitrogen) according to the manufacturer's protocol. RNA 
concentration was determined using Nanodrop Spectrophotometer (Nanodrop 
Technologies). Quantitative real-time reverse transcription (RT)-PCR was 
performed as previously described”*. Hprt and Gapdh were used as housekeep- 
ing genes. For TaqMan assays (Applied Biosystems), RNA was retro-transcribed 
using a High Capacity RNA-to-cDNA Kit (Applied Biosystems), followed by a 
pre-amplification PCR using TaqMan PreAmp Master Mix (Applied Biosystems). 
TaqMan Gene Expression Master Mix (Applied Biosystems) was used in real-time 
PCR. TaqMan Gene Expression Assays (Applied Biosystems) were the following: 
Gapdh Mm99999915_g1; Hprt Mm00446968_m1; Artn Mm00507845_m1; 
Nrtn Mm03024002_m1; Gdnf Mm00599849_m1; Gfral Mm00439086_m1; 
Gfra2 Mm00433584_m1; Gfra3 Mm00494589_m1; Ret Mm00436304_m1; 
1122 Mm01226722_g1; I117a Mm00439618_m1; 1123r Mm00519943_m1; 
Rorgt Mm01261022_m1; I17ra Mm00434295_m1; Ahr Mm00478932_m1; 
Stat3 Mm01219775_m1; Cxcr6 Mm02620517_s1; Nfkbiz Mm_00600522_m1; 
Reg3a Mm01181787_m1; Reg3b Mm00440616_g1; Reg3g Mm00441127_m1; 
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Defal Mm02524428_g1; Defa29 Mm00655850_m1; Defa5 Mm00651548_¢1; 
Defa21 Mm04206099_gH; Mucl Mm00449599_m1; Muc3 Mm01207064_m1; 
Muc13 Mm00495397_m1; Gfap Mm01253033_m1; Ascl2 Mm01268891_g; 
Tff3 Mm00495590_m1; Relm-b Mm00445845_m1; Pla2g2a Mm00448160_m1; 
Pla2g5 Mm00448162_m1; Wnt3 Mm00437336_m1; Ctnnb1 Mm00483039_ m1; 
Axin2 Mm00443610_m1; Dil1b Mm01279269_m1; [118 Mm00434225_m1; 
Tnfa Mm00443260_g1; Lyzl1 Mm00657323_m1; Lrg5 Mm00438890_m1; Tbx21 
Mm00450960_m1; Jd2 Mm00711781_m1; Runx1 Mm01213404_m1; Notch1 
Mm00435249_m1; Notch2 Mm00803077_m1; Gata3 Mm00484683_m1; 
Bcl2 Mm00477631_m1; Bcl2l11 Mm00437783_m1; Arntl Mm00500226_m1; 
Glpr2 Mm01329475_m1; Gjal Mm01179639_s1; Ednrb Mm00432989; S100b 
Mm00485897_m1; Sox10 Mm00569909_m1. Real-time PCR analysis was 
performed using ABI Prism 7900HT Sequence Detection System or StepOne Real- 
Time PCR system (Applied Biosystems). 

ILC3 activation and cell signalling. Sorted intestinal ILC3 cells were starved 
for 3hours in RPMI at 37°C in order to ensure ILC3 viability. Ref” or Ret* 
were analysed directly ex vivo. To test ERK, AKT, p38-MAPK (Cell Signaling 
Technology) and STAT3 (BD Pharmigen) upon GFL stimulation wild-type ILC3 
were activated with 500ngml! (each GFL) and co-receptors (recombinant rat (rr) 
GFR-al, recombinant mouse (rm) GFR-«2, rrGFR-a3 and rrGNDF from R&D 
Systems; recombinant human (rh) NRTN and rhARTN from PeproTech) for 10 
and 30min. When referring to the use of “GFL, we have employed GDNE, NRTN, 
ARTN and their specific co-receptors in combination. For inhibition experiments 
cells were incubated 1h at 37°C before GFL stimulation, to test ERK, AKT, p38/ 
MAPK and STAT3 phosphorylation, or during overnight stimulation with GFLs, 
to determine J/22 expression levels. Inhibitors were purchased from Sigma-Aldrich; 
p38 MAPK/ERK-AKT, LY294002 (LY); ERK, PD98059 (PD); AKT, AKT Inhibitor 
VIII (VIII); p38 MAPK, SB 202190 (SB); and pSTAT3, S31-201 (S3]). 
Chromatin immunoprecipitation (ChIP) assay. Enteric ILC3 from adult 
C57BL/6J mice were isolated by flow cytometry. Cells were starved for 3h 
with RPMI supplemented with 1% HEPES, sodium pyruvate, glutamine, 
streptomycin and penicillin and 0.1% (-mercaptoethanol (Gibco) at 37°C. 
Cells were stimulated with GFLs (500 ng ml“! each)*, lysed, cross-linked and 
chromosomal DNA-protein complex sonicated to generate DNA fragments 
ranging from 100-300 base pairs. DNA-protein complexes were immu- 
noprecipitated, using LowCell ChIP kit (Diagenode)'*, with 31g of rabbit 
polyclonal antibody against anti-pSTAT3 (Cell Signalling Technology), rabbit 
control IgG (Abcam) or H3K36me3 (07-030; Millipore). Immunoprecipitates 
were uncross-linked and analysed by quantitative PCR using primer pairs 
(5°-3’) flanking putative sites on [/22. Vehicle (BSA)-stimulated ILC3s were 
used as controls. [/22 primer sequences were previously described** °°, 
briefly: (a) forward: 5’-TGCAATCAATCCCAGTATTTTG-3’ and reverse: 
5’-CTTGTGCAAGCATAAGTCTCAA-3’; (b) forward: 5’-GAAGTTGGTGG 
GAAAATGAGTCCGTGA-3’ and reverse: 5'/-GCCATGGCTTTGCCGTAG 
TAGATTCTG-3’; (c) forward: 5’-ACGGGAGATCAAAGGCTGCTCT-3’ and 
reverse: 5‘-GCCAACAAGGTGCTTTTGC-3’; (d) forward: 5’-CTCACC 
GTGACGTTTTAGGG-3’ and reverse: 5/-GTGAATGATATGACATCAGAC-3; 
(e) forward 5‘-CGACGAACATGCTCCCCTGATGTTTTT-3’ and reverse: 
5’-AAACTCATAGATTTCTGCAGGACAGCC-3’; (f) forward: 5’-AGCTG 
CATCTCTTTCTCTCCA-3’ and reverse: 5’-TATCCTGAAGGCCAA 
AATAGGA-3’; (g) forward: 5’-ACGACCAGAACATCCAGAAGA-3! and 
reverse: 5’-GCAGAGAAAGAAATCCCCGC-3’; (h) forward: 5’-AGGGGGAC 
TTIGCTTTGCCATTT-3’ and reverse: 5’-AACACCCCTTCTTTCC 
TCCTCCAT-3’; (i) forward: 5’/-CTGCTCCTTCCTGCCTTCTA-3’ and reverse: 
5’-CTGAGCCAGGTTTCATGTGA-3’. Primer positions are shown in Fig. 3i 
relative to the transcription start codon of 1/22. 

Colony-forming units and paracellular permeability. Organs were collected, 
weighed, and brought into suspension. Bacterial colony-forming units were 
determined per gram of tissue and total organ. Colony-forming units were 
determined by serial dilutions on Luria Broth (LB) agar and MacConkey agar 
(Sigma-Aldrich). Colonies were counted after 2 days of culture at 37°C. To address 
intestinal paracellular permeability 16mg per mouse of Dextran-Fitc (Sigma 
Aldrich) were administrated by gavage after overnight starvation. Plasma was 
analysed after 4 hours of Dextran-Fitc administration using a Microplate Reader 
TECAN Infinity F500. 

BrdU administration and Ki-67 labeling. BrdU was administrated by intraperi- 
toneal injection (1.25 mg per mouse). For flow cytometric analysis of epithelial 
cell proliferation anti-BrdU (Staining Kit for flow Cytometry, eBioscience) and 
anti-mouse Ki-67 antibody (BioLegend) were used. 

Quantitative PCR analysis of bacteria in stool at the phylum level. DNA 
from faecal pellet samples was isolated with ZR Fecal DNA MicroPrep (Zymo 
Research). Quantification of bacteria was determined from standard curves 


LETTER 


established by qPCR. qPCR were performed with Power SYBR Green PCR 
Master Mix (Applied Biosystems) and different primer sets using a StepOne 
Plus (Applied Biosystems) thermocycler. Samples were normalized to 16S 
rDNA and reported according to the 244“ method. Primer sequences were: 
16S rDNA, forward: 5’-ACTCCTACGGGAGGCAGCAGT-3’ and reverse: 
5'-ATTACCGCGGCTGCTGGC-3’; Firmicutes, forward: 5’‘-ACTCCT 
ACGGGAGGCAGC-3’ and reverse: 5/-GCTTCT TAGTCAGGTACCGTCAT-3’; 
Bacteroidetes, forward: 5'/-GGTTCTGAGAGGAGGTCCC-3’ and reverse: 
5'-GCTGGCTCCCGTAGGAGT-3’; Proteobacteria, forward: 5’-GGTTC 
TGAGAGGAGGTCCC-3’ and reverse 5’/-GCTGGCTCCCGTAGGAGT-3’. 

16S rRNA quantification and gene sequencing. Faeces were isolated from 
co-housed Re¢! or Ret“ littermates. Sequencing of the 16S rRNA gene was 
performed as previously described*!. Briefly, barcoded primers were used to 
amplify the V4 region of the 16S rRNA gene, and the amplicons were sequenced 
on a MiSeq instrument (Illumina, San Diego, USA) using 150 bp, paired-end 
chemistry at the University of Pennsylvania Next Generation Sequencing Core. 
The paired ends were assembled and quality filtered, selecting for reads with a 
quality score >30. Reads with >10bp homopolymers and reads shorter than 
248 bp or longer than 255 bp were removed from the analysis. 16S rRNA sequence 
data were processed using mothur version 1.25.0 (ref. 52) and QUIME version 1.8 
(ref. 53). Chimaeric sequences were removed with ChimeraSlayer™. Operational 
taxonomic units (OTUs) were defined with CD-HIT* using 97% sequence 
similarity as a cut-off. Only OTUs containing >2 sequences were retained; 
OTUs assigned to cyanobacteria or those which were not assigned to any phylum 
were removed from the analysis. Taxonomy was assigned using the Ribosomal 
Database Project (RDP) classifier v 2.2 (ref. 56), multiple sequence assignment 
was performed with PyNAST (v 1.2.2)°”, and FastTree”® was used to build the 
phylogeny. Samples were rarified to 22,000 sequences per sample for alpha- and 
beta-diversity analyses. Taxonomic relative abundances are reported as the median 
with standard deviation. P values were calculated using the Wilcoxon rank-sum 
test. Statistical tests were conducted in R version 3.2.0. To determine which factors 
were associated with microbial community composition, statistical tests were 
performed using the non-parametric analysis of similarities (ANOSIM) with 
weighted UniFrac distance metrics”. 

Data accession. The sequencing data generated in this study have been submitted 
to the NCBI Sequence Read Archive under BioProject PRJNA314493 (SRA: 
http://www.ncbi.nlm.nih.gow/sra/?term=PRJNA314493). 

Intestinal organoids. IntestiCult Organoid Growth Medium and Gentle Cell 
Dissociation Reagent were purchased from StemCell. Intestinal crypts were 
isolated from C57BL/6] mice according to the manufacturer’s instructions and 
were added to previously thawed, ice-cold Matrigel at a 1:1 ratio and at a final 
concentration of 5,000-7,000 crypts per ml. 15 11 of this mix was plated per well 
of a 96-well round-bottom plate. After Matrigel solidification 100 1l of growth 
medium (100U ml! penicillin/streptomycin) was added and replaced every 
3 days. Organoids were grown at 37°C with 5% CO; and passaged according to the 
manufacturer's instructions. Freshly sorted intestinal ILC3 were added to 5-8 days 
old epithelial organoids after plating for 24 hours with or without anti-mouse IL-22 
antibody (R&D Systems). 

IL-22 agonist administration in vivo. 150 1g of anti-IL-22 antibody (8E11; gift 
from Genentech) or mouse IgG1 isotype control (MOPC-21; Bio X Cell) was 
administered by intraperitoneal injection to Ret"’”® mice every 2 days. Animals 
were analysed 2 weeks after the first administration. 

Neurosphere-derived glial cells. Neurosphere-derived glial cells were obtained as 
previously described. Briefly, total intestines from E14.5 C57BL/6J and Myd88~/~ 
mice were digested with collagenase D (0.5mg ml~'; Roche) and DNase I 
(0.1 mg ml-!; Roche) in DMEM/EF-12, GlutaMAX, supplemented with 1% HEPES, 
streptomycin/penicillin and 0.1% (-mercaptoethanol (Gibco) for approximately 
30 min at 37°C under gentle agitation. Cells were cultured for 1 week in a CO, 
incubator at 37°C in DMEM/F-12, GlutaMAX, streptomycin and penicillin and 
0.1% 8-mercaptoethanol (Gibco) supplemented with B27 (Gibco), EGF (Gibco) 
and FGF2 (Gibco) 20ng ml". After 1 week of culture cells were treated with 0.05% 
trypsin (Gibco), transferred into PDL (Sigma-Aldrich) coated plates and culture 
in DMEM supplemented with 10% FBS, 1% HEPES, glutamine, streptomycin and 
penicillin and 0.1% 8-mercaptoethanol (Gibco) until confluence. Glial cells were 
activated with TLR2 (51g ml~!) (Pam3CSK4), TLR3 (100 ug ml!) (PolyI:C), TLR4 
(50g ml!) (LPS), TLR9 (501g ml!) (DSDNA-EC) ligands from Invivogen and 
IL-18 (10pgml-!) (401ML005), IL-18 (50j1gml~') (B002-5), IL-33 (0.1 j4gml-) 
(3626ML) recombinant proteins from R&D Systems. Cells were also co-cultured 
with purified ILC3 from wild-type and I]1b-deficient mice. IL-22 expression in 
glial-ILC3 co-cultures upon TLR4 activation was also performed using GDNF 
(2jgml-!) (AB-212-NA), NRTN (21g ml!) (AF-387sp) and ARTN (0.3 ,,gml-') 
(AF-1085-sp) blocking antibodies. Cells were analysed after 24 hours of co-culture. 
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Statistics. Results are shown as mean + s.e.m. Statistical analysis used Microsoft 
Excel. Variance was analysed using F-test. Student's t-test was performed on 
homoscedastic populations, and Student’s t-test with Welch correction was applied 
on samples with different variances. Analysis of survival curves was performed 
using a MAntel—Cox test. Results were considered significant at *P < 0.05 and 
**P < 0.01. Statistical treatment of metagenomics analysis is described in the 
Methods section ‘16S rRNA gene sequencing and analysis. 
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Extended Data Figure 1 | ILC3 selectively express the neurotrophic expression in ILC3. Thin line, Ab-treated; bold line, specific pathogen free 
factor receptor RET. a, Expression of RET protein in gut (SPF). g, Ret expression in enteric ILC3 from germ-free (GF) mice and 
CD45*Lin- Thy1.2"IL7RatROR t* ILC3. b, Analysis of gut ILC3 from SPF controls (n = 4). h, Analysis of lamina propria populations from 
Ret@F? mice. Embryonic day 14.5 (E14.5). c, d, Analysis of enteric ILC3 RetS¥? mice. i, Enteric ILC3 clusters. Green, RET/GFP; blue, ROR‘t; red, 
subsets from Ret°*? mice. e, Analysis of cytokine-producing ILC3 from B220. Bottom, quantification analysis for RET/GFP and RORyt 

Ret? mice. f, Pregnant Ret©? mice were provided with antibiotic co-expression (79.97 + 4.72%). j, Rare RET-expressing ILC3 in intestinal 
cocktails that were maintained after birth until analysis at 6 weeks of villi. Green, RET/GFP; blue, RORt; red, CD3<. Scale bars, 10j1m. Data are 
age. Left, RET/GFP (white); right, flow cytometry analysis of RET/GFP representative of 4 independent experiments. Error bars show s.e.m. 
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a, Rorgt-Cre mice were bread to Rosa26*"? mice. Analysis of Rosa26/YFP plasma (n=5). h, Tissue repair genes in Ref* intestinal epithelium in 
expression in gut ILC3 from Rorgt-CreRosa26*"? mice. b, Number of comparison to their wild-type littermate controls (n = 8). i, Reactivity 
Peyer’s patches (PP) (n= 10). ¢, T-cell-derived IL-22 in Ret* mice and genes in RetM=N8 mice treated with anti-IL-22 blocking antibodies 
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in Ret* mice and their wild-type littermate controls (n = 4). e, Intestinal anti-IL-22, n= 4. Data are representative of 3 independent experiments. 


villus and crypt morphology (n= 6). f, Epithelial cell proliferation (n =5). Error bars show s.e.m. 
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Ragi~'~ Ret" littermate controls at day 6 after infection (n = 15). 

b, MacConkey plates of liver cell suspensions from Rag1~/~ Ret* and their 
Ragi~'~ Ret" littermate controls at day 6 after C. rodentium infection. 

c, Whole-body imaging of Rag1~/~ Ret* and their Rag1~'~ Ret" littermate 
controls at day 6 after luciferase-expressing C. rodentium infection. 

d, Epithelial reactivity gene expression in C. rodentium infected 
Rag1~'~Ret* mice (n= 17) and their Rag1~/~ Ret" littermate controls 
(n= 15). e, Weight loss in C. rodentium-infected Rag1~/~ Ret“ mice and 
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C. rodentium infected Rag1~/~ Ret* mice and their Rag1~'~ Ret" littermate 
controls (n= 8). g, C. rodentium translocation to the liver of Ret and their 
Ret!" littermate controls at day 6 after infection (n =6). h, MacConkey 
plates of liver cell suspensions from Ret* and their Re" littermate controls 
at day 6 after C. rodentium infection. i, Whole-body imaging of Ret* 

and their Ret! littermate controls at day 6 after luciferase-expressing 

C. rodentium infection. j, C. rodentium infection burden (n= 8). k, Innate 
IL-22 in in C. rodentium infected Ret* mice and their Ret! littermate 
controls (7 = 8). Data are representative of 3-4 independent experiments. 
Error bars show s.e.m. ns, not significant. *P < 0.05; **P < 0.01. 
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Extended Data Figure 8 | GFL-expressing glial cells anatomically 
co-localize with ILC3. a, Intestine of Ret@'? mice. Green, RET/GEP; 
red, GFAP; blue, ROR4t. Similar results were obtained in 3 independent 
experiments. b, Purified lamina propria LTi, NCR~ and NCR* ILC3 
subsets, T cells (T), B cells (B), dendritic cells (Dc), macrophages (Mg), 
enteric neurons (N) and mucosal glial cells (G). c, Neurosphere-derived 
glial cells. d, Activation of neurosphere-derived glial cells with TLR2 


(Pam3CSK4), TLR3 (Poli I:C), TLR4 (LPS) and TLR9 (DsDNA-EC) 
ligands, as well as IL-1, IL-18 and IL-33 (n=6). M, medium. e, [/22 in 
co-cultures of glial and ILC3 using single or combined GFL antagonists 
(n=6). f, 1122 in co-cultures of ILC3 and glial cells from II1 b~'~ or their 
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Extended Data Figure 9 | Glial cell sensing via MYD838 signals. e-h, Citrobacter rodentium infection of Gfap-CreMyd88* mice and 

a-c, Intestinal glial cells were purified by flow cytometry. a, Germ- their littermate controls (Myd88") (n = 6). e, Innate IL-22. f, Citrobacter 
free and their respective SPF controls (n= 3). b, Myds8~/ ~ and their rodentium translocation. g, Infection burden. h, Weight loss. Data are 
respective wild-type littermate controls (n = 3). c, Gfap-CreMyd88° representative of 3 independent experiments. Error bars show s.e.m. 
and their littermate controls (Myd88") (n=3). d, Total lamina propria *P< 0.05; **P<0.01. 


cells of Gfap-CreMyd88°“ and their littermate controls (Myd88") (n =6). 
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Extended Data Figure 10 | A novel glial-ILC3-epithelial cell unit phosphorylation. GFL induced innate IL-22 acts on epithelial cells to 
orchestrated by neurotrophic factors. Lamina propria glial cells sense induce reactivity gene expression (CBP, commensal bacterial products; 
microenvironmental products that control neurotrophic factor expression. | AMP, antimicrobial peptides; Muc, mucins). Thus, neurotrophic factors 
Glial-derived neurotrophic factors operate in an ILC3-intrinsic manner are the molecular link between glial cell sensing, innate IL-22 production 
by activating the tyrosine kinase RET, which directly regulates innate and intestinal epithelial barrier defence. 


IL-22 downstream of a p38 MAPK/ERK-AKT cascade and STAT3 
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The mechanism of RNA 5’ capping with NAD‘, 
NADH and desphospho-CoA 


Jeremy G. Bird!**, Yu Zhang**, Yuan Tian?, Natalya Panova‘, Ivan Barvik°, Landon Greene’, Min Liu’, Brian Buckley’, 
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The chemical nature of the 5’ end of RNA is a key determinant of 
RNA stability, processing, localization and translation efficiency”, 
and has been proposed to provide a layer of ‘epitranscriptomic’ 
gene regulation>. Recently it has been shown that some bacterial 
RNA species carry a 5’-end structure reminiscent of the 5’ 
7-methylguanylate ‘cap’ in eukaryotic RNA. In particular, RNA 
species containing a 5’-end nicotinamide adenine dinucleotide 
(NAD*) or 3’-desphospho-coenzyme A (dpCoA) have been 
identified in both Gram-negative and Gram-positive bacteria*. It 
has been proposed that NAD*, reduced NAD* (NADH) and dpCoA 
caps are added to RNA after transcription initiation, in a manner 
analogous to the addition of 7-methylguanylate caps®®. Here we 
show instead that NADt, NADH and dpCoA are incorporated into 
RNA during transcription initiation, by serving as non-canonical 
initiating nucleotides (NCINs) for de novo transcription initiation 
by cellular RNA polymerase (RNAP). We further show that both 
bacterial RNAP and eukaryotic RNAP II incorporate NCIN caps, 
that promoter DNA sequences at and upstream of the transcription 
start site determine the efficiency of NCIN capping, that NCIN 
capping occurs in vivo, and that NCIN capping has functional 
consequences. We report crystal structures of transcription 
initiation complexes containing NCIN-capped RNA products. Our 
results define the mechanism and structural basis of NCIN capping, 
and suggest that NCIN-mediated ‘ab initio capping’ may occur in 
all organisms. 

NAD?t, NADH, and dpCoA are nucleotides that share an adenosine- 
diphosphate substructure with ATP (Fig. la, red). Therefore, in principle, 
NAD*, NADH, and dpCoA could serve in place of ATP as initiating 
nucleotides in de novo transcription initiation by a cellular RNAP, 
and thereby be incorporated ab initio into RNA (see Supplementary 
Discussion). To assess whether NAD*, NADH, and dpCoA serve 
as initiating nucleotides for a cellular RNAP, we performed in vitro 
transcription experiments with Escherichia coli RNAP and DNA tem- 
plates containing promoters for RNAs identified as the most-highly 
NAD*-capped in vivo® (Peady and Pynas Fig. 1b; Extended Data Fig. 1b). 
We performed reactions using ATP, NAD*, NADH, or dpCoA as 
initiating nucleotide and [«**P]-CTP as extending nucleotide. In 
each case, we observed efficient formation of an initial RNA product 
(Fig. 1b; Extended Data Fig. 1b). RppH, which processes 5’-triphosphate 
and 5’-diphosphate RNAs to 5’-monophosphate RNAs”, processed the 
product obtained with ATP, but had no effect on products obtained with 
NAD*, NADH, and dpCoA (Fig. 1b; Extended Data Fig. 1b). Conversely, 
NudC, which processes 5’-capped RNAs to 5'‘-monophosphate 
RNASs*%, had no effect on the product formed with ATP, but processed 
products obtained with NAD*, NADH, and dpCoA (Fig. 1b; Extended 
Data Fig. 1b). Liquid chromatography/tandem mass spectrometry 


analysis confirmed the detection and structural assignment of the prod- 
uct of NAD*-mediated transcription initiation (Extended Data Fig. 2). 

To assess whether initial RNA products formed with NCINs can 
be extended to full-length RNA products, we performed transcription 
reactions with ATP, NAD*, NADH, or dpCoA as initiating nucleo- 
tide and [a**P]-UTP, CTP, and GTP as extending nucleotides on 
templates containing P,,,; linked to a 100-nucleotide cassette that did 
not contain any adenine bases. We observed production of full-length 
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Figure 1 | De novo transcription initiation by ATP and NCINs. 

a, Structures of ATP, NAD*, NADH, and dpCoA. Red, identical atoms. 
b, Initial RNA products of in vitro transcription reactions with ATP, 
NAD*, NADH, or dpCoA as initiating nucleotide and [«**P]-CTP as 
extending nucleotide (E. coli RNAP; Pyaay). Products were treated with 
RppH (processes 5’-triphosphate RNA to 5’‘-monophosphate RNA and 
5’-NTP to 5/-NDP/5’/-NMP*!*) or NudC (processes 5/-NADt/NADH- 
capped RNA to 5’-monophosphate RNA‘) as indicated. c, d, Full-length 
RNA products of in vitro transcription reactions with ATP, NAD*, 
NADH, or dpCoA as initiating nucleotide and [a**P]-CTP, GTP, and 
UTP as extending nucleotides (c), or with [°*P]-ATP or [a*’P]-NAD* as 
initiating nucleotide and CTP, GTP, and UTP as extending nucleotides 
(d) (E. coli RNAP; P;nar fused to an A-less cassette ). M, 100-nucleotide 
marker. For gel source data, see Supplementary Fig. 1. 
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Figure 2 | Promoter-sequence effects on efficiency of NCIN-mediated 
transcription initiation. a, NCIN capping requires A,;. Top, promoters 
of NAD*-capped RNAs (promoter elements and start sites in grey). 
Bottom, initial RNA products of in vitro transcription reactions with 
ATP, NAD*, NADH, or dpCoA as initiating nucleotide and [a*?P]-CTP 
as extending nucleotide (E. coli RNAP; WT (+1A), Prnar (upper) or Peaay 
(lower); Mut. (+1G), +1G derivative of P;naz (upper) or Pgaay (lower)). 

b, Promoter sequence determinants in addition to A, affect NCIN 
capping. Top, control +1A promoters. Bottom left, dependence of NADT 
capping on [NAD*]/[ATP] ratio (mean £s.e.m. of four determinations). 
Bottom right, relative efficiencies of NAD* capping. c, Promoter position 
—1 affects NCIN capping. Top, Prnai—ic) (—1 in black). Other features 

as in b (mean + s.e.m. of three determinations). For gel source data, see 
Supplementary Fig. 1. 


products in all cases, and observed that full-length products with 
NAD*, NADH, and dpCoA contained NudC-sensitive, capped 5’/-ends 
(Fig. 1c). Equivalent results were obtained using [?°P]-NAD*, rather 
than [a*?P]-UTP, to detect products (Fig. 1d; Extended Data Fig. 3). 
Because NAD*, NADH, and dpCoA have the same Watson-Crick 
base pairing preferences as ATP (Fig. 1a), NCIN-mediated initiation 
should occur only for RNAs transcribed from promoters that contain 
A:T at the transcription start site +1 (-++1A promoters). Consistent with 
this inference, we find that NCIN-mediated initiation occurs only with 
+1A promoters, and not with corresponding +1G promoter deriva- 
tives (Fig. 2a). 
In principle, other aspects of promoter sequence, in addition to 
+1A at the transcription start site, could affect NCIN-mediated initi- 
ation. Consistent with this inference, we find that relative efficiencies 
of NAD*-mediated initiation and ATP-mediated initiation, (Keat/Km, 
NAD*)/(kcat/Km ATP), differ over one to two orders of magnitude at 
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Figure 3 | NCIN-mediated transcription initiation in vivo. a, Templates 
having rnal, T7A1, and rnaI(-1C) promoters fused to identical transcribed 
regions (promoter elements, start sites, and position of RNA 3’-end 

in grey; DNA that directs synthesis of reference RNA in blue; site for 
MazF-mt3 endoribonuclease used to generate RNA products having 
uniform RNA 3/-ends, underlined). b, NCIN capping in vitro (left; 1 mM 
NAD*, 2001.M ATP, CTP, UTP, and GTP) and in vivo (right; RNA isolated 
from cells). RNA products were treated with MazF-mt3 or MazF-mt3 plus 
NudC, and detected by hybridization. M, 40-nucleotide and 50-nucleotide 
markers. c, Effects of NCIN capping on RNA stability in vivo (rnal 
template; left, exponential-phase cells; right, stationary-phase cells; times, 
minutes after addition of RNA-synthesis inhibitor rifampin; half-life 
values are the mean + s.e.m. of three determinations for exponential phase 
and five determinations for stationary phase). For gel source data, see 
Supplementary Fig. 1. 


different +1A promoters, with P,ng and Pgagy exhibiting high rela- 
tive efficiencies and Py25 and P 774; promoters exhibiting low relative 
efficiencies (Figs 2a, b; Extended Data Figs 4 and 5). Substitution of 
the Pax promoter position immediately upstream of the transcription 
start site, —1, results in an order-of-magnitude reduction in relative 
efficiency (Fig. 2c), indicating that position —1 is a key determinant of 
NAD*-mediated initiation. 

Having obtained results establishing that NAD‘, NADH, and dpCoA 
function as NCINs for transcription initiation in vitro and that pro- 
moter sequence determines the efficiency of NCIN-mediated initiation 
in vitro, we hypothesized that NCIN-mediated initiation also may occur 
in vivo and be responsible for the generation of NAD*-, NADH-, and 
dpCoA-capped RNAs in vivo. Consistent with this hypothesis, the pro- 
moters for full-length RNAs identified as enriched for NADt-capping 
in vivo all are +1A promoters®, the class of promoters competent for 
NAD*-mediated initiation in vitro (Fig. 2a). Further consistent with 
this hypothesis, the promoters identified as most-highly enriched for 
NAD* capping in vivo are Png; and Pudi the two promoters with 
highest efficiencies of NAD* -mediated initiation in vitro (Fig. 2b). 

To show directly that NCIN-mediated initiation occurs in vivo, 
we assessed whether the promoter sequence dependence for NCIN- 
mediated initiation observed in vitro is observed in vivo. We fused pro- 
moters observed in vitro to have high (P naz) and low (P 77415 Prna(—10)) 
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efficiencies of NAD*-mediated initiation (Figs 2b, c) to transcribed 
regions directing synthesis of the same RNA (reference RNA; Fig. 3a) 
and compared yields of NudC-sensitive, capped RNA products (Fig. 3b). 
We found that the promoter with high efficiency of NAD*-mediated 
initiation in vitro yielded high levels of NudC-sensitive products in vivo 
(P:nab Fig. 3b), whereas the promoters with low efficiency of NAD*- 
mediated initiation in vitro yielded low, undetectable, levels of NudC- 
sensitive products in vivo (P7741; Prna(—ic); Fig. 3b). We conclude that 
NCIN-mediated initiation occurs in vivo, and that most, or all, NCIN- 
capped RNA products in vivo arise from NCIN-mediated initiation. 
To assess whether NCIN capping has functional consequences 
in vivo, we measured stabilities of NCIN capped and uncapped RNA 
in vivo, using a ANudC strain to eliminate NudC processing. We 
observe that NCIN capping results in a large, approximately three- to 
fourfold, increase in RNA stability (Fig. 3c), demonstrating a functional 
consequence of NCIN capping in vivo. We further observe that levels 
of NCIN capping are approximately twofold higher in stationary phase 
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Figure 4 | Structural basis of NCIN-mediated 
transcription initiation. a—c, Crystal structures of 
RPo-pppApC, RPo-NAD*pC, and RPo-dpCoApC. 
Left, electron density and atomic model for initial 
RNA product. Green mesh, F, — F, omit map 
(contoured at 2.50 in a, b and 2.2 in c); red, DNA; 
pink, RNA product and diphosphate in E site’ (see 
refs 15-17); violet spheres, Mg”*(I) and Mg** (II); 
grey, RNAP bridge helix (BH). Right, contacts 
between RPo and initial RNA product. Green and 
orange, carbon and phosphorus atoms derived 
from initiating nucleotide; pink, atoms derived 
from extending nucleotide; red, DNA atoms and 
non-DNA oxygen atoms; blue, nitrogen atoms; grey 
sticks, RNAP carbon atoms; violet sphere, Mg* (1); 
grey ribbon, RNAP bridge helix. 
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versus exponential phase (Fig. 3c), demonstrating a growth-phase 
dependence in NCIN capping. 

All cellular RNAPs are members of a protein family having conserved 
structures and mechanisms’?~””. To assess whether eukaryotic RNAP II, 
like bacterial RNAP, can use NCINS as initiating nucleotides, we per- 
formed in vitro transcription experiments with Saccharomyces cerevisiae 
RNAP II using ATP, NAD*, or NADH as initiating nucleotide and 
[a’P]-UTP as extending nucleotide (Extended Data Fig. 6). The results 
matched those for bacterial RNAP: initiation occurred with NAD* and 
NADH, and products were insensitive to RppH and sensitive to NudC. 
We conclude that eukaryotic RNAP II can initiate transcription with 
NAD? and NADH, and we suggest that ab initio capping with NAD* 
and NADH also may occur in eukaryotes. 

To define the structural basis of transcription initiation with ATP, 
NAD*, and dpCoA as initiating nucleotides we determined crys- 
tal structures of product complexes obtained by soaking crystals of 
Thermus thermophilus RNAP-promoter open complex", RPo, with 
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ATP and CTP, NAD? and CTP, and dpCoA and CTP (Fig. 4; Extended 
Data Fig. 7; Extended Data Table 1). The results indicate that, in each 
case, the initiating entity ATP, NAD*, or dpCoA—and an extending 
nucleotide (CTP) were able to react in crystallo to form an initial RNA 
product—pppApC, NAD*pC, or dpCoApC, respectively—and that 
RNAP was able to translocate in crystallo by one base pair relative to 
the nucleic-acid scaffold, yielding a complex in a post-translocated state 
poised for RNA extension (Fig. 4). The results for ATP provide the first 
structural description of an initial product complex for a cellular RNAP. 
The results for NAD* and dpCoA show graphically that NAD* and 
dpCoA serve as initiating nucleotides in transcription initiation and 
define the interactions that the initial RNA products make with DNA 
and RNAP (see Supplementary Discussion). 

Our results establish that NAD*, NADH, and dpCoA function as 
non-canonical initiating nucleotides (NCINs) for de novo transcription 
initiation and demonstrate the occurrence of ab initio, as opposed to 
post-transcription-initiation, capping of cellular RNAs. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Proteins. E. coli RNAP core enzyme was prepared from BL21(DE3) cells trans- 
formed with plasmid pVS10 (gift from I. Artsimovitch) as described in ref. 18. 
o”° was prepared from BL21(DE3) cells transformed with plasmid po”°-His (gift 
from J. Roberts) as described in refs 19,20. RppH was purchased from NEB. 
NudC was purified from BL21(DE3) cells from pET NudC-His, which was 
prepared as described previously®. Briefly, nudC was amplified by PCR from 
E. coli strain MG1655 using oligonucleotides JB221 and JB222. The PCR product 
was digested with Xbal and NotI and inserted into pET28c digested with Xbal 
and NotI. T: thermophilus RNAP holoenzyme was prepared as described previ- 
ously!?, Saccharomyces cerevisiae RNAP II was prepared as described previously”). 
MazF-mt3 protein, prepared as described previously”, was a gift from N. Woychik. 
In vitro transcription assays with E. coli RNAP: linear templates. Linear tran- 
scription templates were synthesized by PCR using Phusion HF Polymerase master 
mix (Thermo) and oligonucleotides listed in Supplementary Table 1. PCR reac- 
tions contained 5 nM of the indicated template oligonucleotide, and 0.5 1M of the 
indicated forward and reverse primers. PCR products were purified using Qiagen 
PCR-clean up columns before use in transcription reactions. 

Sequence of gadY (—65 to +35) template used in Figs 1b and 2a, b, and 
Extended Data Figs 4 and 5: 
AGCGTATAGCTTATGTTTATAAAAAAATGGCTGATCTTATTTCCAGTAA 
AAGTTATATTTAACTTACTGAGAGCACAAAGTTTCCCGTGCCAACAGG 
GAG (prepared using oligos JB224, JB230, JB231). 

Sequence of gadY (+1G) (—65 to +35) template used in Fig. 2a: 
AGCGTATAGCTTATGTTTATAAAAAAATGGCTGATCTTATTTCCAGTAA 
AAGTTATATTTAACTTGCTGAGAGCACAAAGTTTCCCGTGCCAACAGG 
GAG (prepared using oligos JB244, JB230, JB231). 

Sequence of rnaI (—65 to +35) template used in Figs 2a—c and Extended Data 
Figs 1b, 4, and 5: 
CGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTA 
CGGCTACACTAGAAGAACTGTATTTGGTATCTGCGCTCTGCTGAAGCC 
AGTT (prepared using oligos JB288, JB281, JB269). 

Sequence of mutant rnalI (+1G) (—65 to +35) template used in Fig. 2a: 
CGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTA 
CGGCTACACTAGAAGAGCTGTATTTGGTATCTGCGCTCTGCTGAAGCC 
AGTT (prepared using oligos JB287, JB281, JB269). 

Sequence of mutant rnaI (—1C) (—65 to +35) template used in Fig. 2c: 
CGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTA 
CGGCTACACTAGAAGCACTGTATTTGGTATCTGCGCTCTGCTGAAGCC 
AGTT (prepared using oligos JB367, JB281, JB269). 

Sequence of rnal (—65 to +112) A-less cassette template used in Figs 1c, d and 
Extended Data Fig. 3: 
CGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTA 
CGGCTACACTAGAAGAACTTGTTTTGGTGTCTGCGCTCCTCCTTGCCT 
GTTTCCTCGGTTCTTTGTGTTGGTTGCTCTGTGTTCCTTCGTTTTTCC 
GCCCTGCTTGGCGGTTTTTTCGTTTTCTGTGC (prepared using oligos 
JB288, JB281, JB251). 

Sequence of N25 (—65 to +35) template used in Fig. 2b and Extended Data 
Figs 4 and 5: 
ATCCGTCGAGGGAAATCATAAAAAATTTATTTGCTTTCAGGAAAATTT 
TTCTGTATAATAGATTCACTAATTTGAGAGAGGAGTTTAAATATGGCTG 
GTT (prepared using oligos JB232, JB233, JB234). 

Sequence of T7A1 (—65 to +35) template used in Fig. 2b and Extended Data 
Figs 4 and 5: 
GATTAATTTAAAATTTATCAAAAAGAGTATTGACTTAAAGTCTAACCTA 
TAGGATACTTACAGCCACTGAGAGGGACACGGCGAATAGCCATCCCAAC 
GA (prepared using oligos JB235, JB228, JB229). 

Promoter —35 elements, —10 elements and transcription start sites are in bold 
and underlined. 

In vitro transcription assays with E. coli RNAP: abortive initiation assays. 
10nM of linear template was mixed with 50nM core RNAP and 250nM o” in 
60,11 transcription buffer (10 mM Tris-HCl pH 8.0, 40mM KCl, 10mM MgCh, 
0.1mM EDTA, 1mM DTT, 0.1 mg ml~! BSA, 2% glycerol) and incubated at 
37°C for 15 min to form open complexes. The indicated initiating nucleotide 
(0.2mM ATP (Roche), 1mM NAD* (Roche), 1mM NADH (Roche) or 1mM 
dpCoA (Sigma-Aldrich)) was added along with 6 j.Ci [a**P]-CTP (Perkin Elmer; 
3,000 Cimmol™!) and reactions were incubated at 37°C for 10 min to allow for 
product formation. For the NAD*/ATP competition assays shown in Fig. 2b, c, 
1mM NAD* was present in each reaction along with increasing concentrations of 
ATP. The data in Extended Data Fig. 4b shows initial products formed in reactions 


with 1mM NAD* and 50,.M ATP. For the NADH/ATP competition assays shown 
in Extended Data Fig. 5a, 1 mM NADH was present in each reaction along with 
increasing concentrations of ATP. For the dpCoA/ATP competition assays shown 
in Extended Data Fig. 5b, 1mM dpCoA was present in each reaction along with 
increasing concentrations of ATP. 

For experiments shown in Fig. 2a—c and Extended Data Figs 4 and 5, reactions 
were stopped by addition of an equal volume of gel loading buffer (90% formamide, 
100 mM Tris-HCl, pH 8.0, 18 mM EDTA, 0.025% xylene cyanol, 0.025% bromo- 
phenol blue). For the reactions shown in Fig. 1b and Extended Data Fig. 1b the 
samples were passed through a Nanosep Centrifugal Device (10 kDa molecular 
weight cut-off; Pall Corporation), the flow-through was collected, mixed with 1 U 
RppH (NEB), 400nM NudC or NudC storage buffer (50 mM Tris-HCl, pH 7.9, 
0.5M NaCl, 50mM KCl, 1mM MgCh, 7mM TCEP, 50% glycerol), incubated at 
37°C for 30 min, and mixed with an equal volume of gel loading buffer. Samples 
were run on 15% TBE-urea polyacrylamide gels (UreaGel system, National 
Diagnostics). Autoradiography of gels was performed using storage phosphor 
screens and a Typhoon 9400 variable mode imager (GE Life Science) and quan- 
tified using ImageQuant software. We note that the presence of contaminating 
AMP in the dpCoA stock (Extended Data Fig. 8) leads to the generation of pApC 
products in reactions performed with dpCoA. 

For the competition assays in Fig. 2b, c and Extended Data Fig. 5, (NCIN) 
pC and pppApC products were quantified, the fraction of (NCIN)pC products 
was plotted vs. [NCIN]/[ATP], and the value of (keat/KmNCIN)/(kcat/Kms ATP) 
was calculated as the value of [NCIN]/[ATP] for which the fraction of (NCIN)pC 
products was 0.5. Values were obtained from averaging results of three (Fig. 2c, 
Extended Data Fig. 5) or four (Fig. 2b) technical replicates. 

In vitro transcription assays with E. coli RNAP: run-off transcription assays. 
For reactions shown in Fig. 1c, open complexes formed as described above were 
mixed with the indicated initiating nucleotide (0.2 mM ATP, 1!mM NAD‘, 1mM 
NADH or 1mM dpCoA) along with 6 {Ci [a**P]-CTP (Perkin Elmer; 3,000 
Cimmol~!), 200\1M CTP, 200|1M UTP, and 200|1M GTP, and reactions were 
incubated at 37°C for 10 min to allow for product formation. For the reactions 
shown in Fig. 1d and Extended Data Fig. 4, open complexes formed as described 
above were mixed with 10.Ci [~?P]-ATP (Perkin Elmer; 6,000 Cimmol7?), or 
20 Ci [a??P]-NAD* (Perkin Elmer; 800 Cimmol~') along with 200|.M CTP, 
200,.M UTP and 200\.M GTP. Reactions were incubated at 37°C for 10 min 
to allow for product formation, and stopped by addition of 100,11 stop solution 
(0.6 M Tris-HCl, pH 8.0, 18mM EDTA, 0.1 mg ml“! glycogen). Samples were 
extracted with acid phenol:chloroform and RNA transcripts were recovered by 
ethanol precipitation, resuspended in transcription buffer and divided into 10 il 
aliquots. For the experiment in Fig. 1d, 400nM NudC was added to one aliquot 
while NudC storage buffer was added to the other for mock treatment. For the 
experiment in Extended Data Fig. 3, 0.25 U alkaline phosphatase (Thermo Fisher) 
was added to one aliquot, 400 nM NudC was added to one aliquot, both 0.25 U 
alkaline phosphatase and 400 nM NudC was added to one aliquot, and NudC 
storage buffer was added to another aliquot for mock treatment. Reactions were 
incubated at 37°C for 30 min and stopped by addition of an equal volume of gel 
loading buffer and analysed by gel electrophoresis on 15% TBE-urea polyacryla- 
mide gels as described above. Reactions were loaded alongside an Ambion Decade 
Marker System (Life Technologies) to enable an estimation of the approximate 
size of the full-length transcripts generated. 

In vitro transcription assays with E. coli RNAP: transcription assays with neg- 
atively supercoiled DNA templates. Plasmid pJB89 contains sequence for the 
Pnai-reference RNA fusion inserted between the Xbal and Sall sites of plasmid 
pSG289’, The sequence of the insert is: 
TCTAGACGAGGTATGTAGGCGGTGCTACAGAGT TCT TGAAGTGGTGGC 
CTAACTACGGCTACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGCA 
CGATGGGTTAATTCGTTCCTTGGATCCGAATAGCCATCCCAATCGAAC 
AGGCCTGCTGGTAATCGCAGGCCTTTTTATTTGGATGTCGAC. 

Plasmid pJB91 contains sequence for the Pr74)-reference RNA fusion 
inserted between the Xbal and Sall sites of plasmid pSG289*°. The sequence 
of the insert is: 
TCTAGAGATTAATTTAAAATTTATCAAAAAGAGTATTGACTTAAAGTCTA 
ACCTATAGGATACTTACAGCCACAGTATTTGGTATCTGCGCTCTGCACG 
ATGGGTTAATTCGTTCCTTGGATCCGAATAGCCATCCCAATCGAACAGG 
CCTGCTGGTAATCGCAGGCCTTTTTATTTGGATGTCGAC, 

Plasmid pJB95 contains sequence for the Pynay—ic)-reference RNA fusion 
inserted between the Xbal and Sall sites of plasmid pSG289”*. The sequence of 
the insert is: 
TCTAGACGAGGTATGTAGGCGGTGCTACAGAGT TCT TGAAGTGGTGGC 
CTAACTACGGCTACACTAGAAGCACAGTATTTGGTATCTGCGCTCTGCA 
CGATGGGTTAATTCGTTCCTTGGATCCGAATAGCCATCCCAATCGAACAG 
GCCTGCTGGTAATCGCAGGCCTTTTTATTTGGATGTCGAC. 
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Promoter —35 elements and —10 elements are in bold and underlined. DNA 
that directs synthesis of the reference RNA is coloured blue. Recognition site for 
MazF-mt3 (UCCUU; ref. 22) is underlined. Reference RNA includes sequence of 
the tR2 terminator. 

For in vitro reactions shown in Fig. 3b, open complexes were formed using 
10nM of plasmid (pJB89, pJB91, or pJB95) and 50nM core RNAP and 250nM 
o”° in 304 transcription buffer (see above). Transcription was carried out for 
10 min at 37°C using 1 mM NAD‘, 200 uM ATP, UTP, CTP and GTP. Reactions 
were stopped by the addition of 120 1 of stop solution and samples were extracted 
with acid phenol:chloroform and ethanol precipitated. The resulting RNAs were 
resuspended in 10mM Tris, pH 8.0, supplemented with 2 1g of RNA isolated from 
wild-type MG1655 cells and treated with 2 1M MazF-mt3 for 1h at 37°C. Reactions 
were divided into two, 9 j1l aliquots and 1 jul 10x transcription buffer was added 
to each. 1j1M NudC was added to one of these aliquots and NudC storage buffer 
was added to the other. Reactions were incubated at 37°C for 90 min and stopped 
by addition of an equal volume of gel loading buffer. 

Analysis of RNA products generated in vivo. For the experiments of Fig. 3b, 
E. coli MG1655 cells containing plasmids pJB89, pJB91, or pJB95 were shaken 
at 220 1r.p.m. at 37°C in 25 ml LB (10g Bacto-tryptone, 5 g Bacto-yeast extract 
and 10g NaCl per litre) containing 251g ml’ chloramphenicol in 125 ml flasks 
(Bellco). When cell density reached an OD¢09 ~0.6, 2 ml of the cell suspension 
was centrifuged (30s, 10,000g at room temperature) to collect cells, supernatants 
were removed, and cell pellets were rapidly frozen on dry ice and stored at -80°C. 

For the RNA stability experiments of Fig. 3c, strain MG1655 AnudC::Kan was 
constructed by P1 transduction of the AnudC::Kan cassette from JW5548” into 
MG1655. MG1655 AnudC::Kan cells containing plasmids pJB89, pJB91, or pJB95 
were grown as described above to an OD¢09 ~0.6 (exponential phase) or an OD¢00 
~3.5 (stationary phase). 2 ml of the cell suspension was collected as described 
above (the RNA derived from these cells was used as the 0 min time point). 
Rifampicin was added to a final concentration of 2mg ml! to the remainder of the 
culture. 2-ml aliquots were taken at the indicated times and cells were collected as 
described above. Frozen pellets were resuspended in 1 ml of TRI Reagent solution 
(Molecular Research Center). Samples were incubated at 70°C for 10 min and then 
centrifuged (10 min, 21,000g, 4°C) to remove insoluble material. The supernatant 
was transferred to a fresh tube and 20011 of chloroform was added, samples were 
mixed by vortexing, and then centrifuged (10 min, 21,000g, 4°C). The aqueous 
phase was transferred to a fresh tube, extracted with acid phenol:chloroform, 
and RNA transcripts were recovered by ethanol precipitation and resuspended 
in 10mM Tris, pH 8.0. 

5 ug of isolated RNA was treated with 21M MazF-mt3 in 10mM Tris, pH 8.0 
(total volume, 25 11) and incubated for 60 min at 37 °C. Reactions were stopped by 
addition of an equal volume of gel loading buffer. In cases where RNA products 
were treated with NudC, reactions were divided into into two aliquots and 10x 
transcription buffer was added to each to a final concentration of 1x. 11M NudC 
was added to one of these aliquots and NudC storage buffer was added to the other. 
Reactions were incubated at 37 °C for 45 min and stopped by addition of an equal 
volume of gel loading buffer. 

Detection of RNA products generated in vitro or in vivo by hybridization. In 
Fig. 3b, c, RNA products were detected by a procedure comprising electrophoresis 
on 10% 7.5 M urea slab gels (equilibrated and run in 1x TBE), transfer to and UV 
cross-linking to a membrane (Nytran supercharge nylon membrane, GE Healthcare 
Life Sciences), hybridization with a 32p_labelled ‘locked-nucleic-acid’ probe com- 
plementary to positions +31 to +42 of the reference RNA (sequence, 
agCaaAttAacCc; LNA bases capitalized; purchased from Exiquon; 32P_labelled 
using T4 polynucleotide kinase), high stringency washing (procedure as described 
in ref. 25), and storage-phosphor imaging. Bands were quantified using 
ImageQuant software. The percentage of RNA products in the upper band was 


upperband 


determined for each sample as: 100 x Values of the per cent 


‘(upper band + lower band)’ 
of NudC-sensitive products represent the reduction in the per cent of RNA prod- 
ucts in the upper band upon treatment with NudC. Values shown represent the 
average of three technical replicates (in vitro) or three biological replicates (in vivo). 
For the experiments in Fig. 3c, values of half-life were determined by fitting data 
to a single-exponential decay function (mean +s.e.m. of three biological replicates 
for exponential phase and five biological replicates for stationary phase). 
In vitro transcription assays with E. coli RNAP: liquid chromatography/tandem 
mass spectrometry (LC/MS/MS) detection of NAD* pC. Open complexes formed 
as described above were mixed with 2mM NAD* and 10 1M CTP, incubated at 
37°C for 1h, passed through a Nanosep Centrifugal Device, and the flow-through 
was analysed by LC/MS/MS. (Control samples in which NAD*, CTP, or RNAP 
were not present in the reactions were also prepared.) LC/MS/MS analyses were 
performed in negative ion mode with a Finnigan LTQ mass spectrometer equipped 
with an electrospray ionization (ESI) interface coupled with a Finnigan Surveyor 
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HPLC system. The flow rate from the Finnigan Surveyor pump was 0.3 ml min~!. 


The autosampler temperature was 5°C. The liquid chromatography method used 
a YMC ODS-A 51m particle size 120A pore size column, 3.0mm x 100mm. 
The samples were separated using a gradient mobile phase consisting of 5mM 
ammonium formate buffered to pH 7.9 in water (A) and methanol (B). The gradi- 
ent condition was: 0-5 min, 100% A; 5-15 min, 100-30% A; 15-23 min, 30-0% A; 
23-28 min, 0% A; 28-40 min, 100% A. Column temperature was 25°C. The 
temperature of the heated capillary was 200°C. Fragmentation was activated by 
collision induced dissociation (CID) with a collision energy of 20-25%. The search 
for NAD*pC was conducted by isolating its anion ((M-2H), m/z 967) at isolation 
width 3 using full scan mode, and analysing its fragmentation spectrum (collision 
energy 20%) from m/z=210 to m/z= 1,000. The characteristic fragment m/z= 845 
(loss of nicotinamide) was used to identify NAD* pC. The instrument control, data 
acquisition, and data analysis were performed by Xcalibur software. Ammonium 
formate (99%), water (HPLC grade) and methanol (HPLC grade) used in these 
LC/MS/MS experiments were purchased from Sigma-Aldrich. 

In vitro transcription assays with Saccharomyces cerevisiae RNAP II. Bubble 
templates for initiation were used following the approach in ref. 26 by mixing 
2.5 nmol of oligonucleotide CKO1639 and 2.5 nmol of oligonucleotide CKO1621 
(see Supplementary Table 1) in 100,11 of annealing buffer (40 mM Tris, pH 8.0, 
50mM NaCl, 1mM EDTA), incubating at 95°C for 3 min, and slowly cooling 
to 23°C (1°C min“) to allow annealing. The oligonucleotide mixture was run 
on a 2.5% agarose gel (Apex, Genesee) equilibrated and run in 1x TBE (90 mM 
Tris base (BioRad), 90 mM boric acid (Calbiochem), 2mM EDTA (JT Baker)), 
the gel was stained with ethidium bromide and the band corresponding to the 
bubble template was excised from the gel, crushed, and incubated with 500 il 
elution buffer (500 mM NH4OAc, 10mM Mg(OAc)>, 1 mM EDTA, 0.1% SDS). 
Gel debris was removed using a Spin-X column (Costar) and nucleic acids 
were isolated by ethanol precipitation and re-suspended in nuclease-free water 
(Ambion) to a concentration of 0.5,.g jl ~!. 500 ng of the template was mixed 
with M280 magnetic streptavidin beads (Invitrogen) in 1x TE (from Ambion 
components) supplemented with 1 M NaCl (EMD/Calbiochem) and incubated at 
25°C for 30 min to allow binding. Templates bound to the beads were washed 
three times using magnetic capture to separate beads in 1x TE supplemented 
with 1 M NaCl, once with 1x TE, and then equilibrated into 1x transcription 
buffer (TB: 20mM Tris, pH 8.0 (Ambion), 40 mM KCl (Ambion), 10 mM MgCl, 
(Ambion), 10% glycerol (Macron/Avantor), 0.25 mg ml! BSA (Ambion), 1 mM 
DTT (Gold Biotechnology)). 

2.5,g purified Saccharomyces cerevisiae RNAP II was added to each template 
in 501 1x TB, and incubated with rotation at 25°C for 20 min. Complexes were 
washed following magnetic capture twice with 20011 1x TB, and resuspended in 
22,1 2x TB containing 0.4U pl! RNase Inhibitor (NEB). 7.511 of this mixture 
was aliquoted for reactions (four reactions per bubble template). 

5 1] of ‘initiator’/ [a*P]-UTP mixes were added to give final concentrations of: 
1mM ATP (GE Lifesciences), 2.5}1.M [a?*P]-UTP (from 800 Cimmol™! (12.5 41M), 
Perkin Elmer); 1mM NAD*, 2.5.M [a*P]-UTP (from 800 Cimmol7! (12.5 1M), 
Perkin Elmer); or 1mM NADH, 2.51M [a**P]-UTP (from 800 Cimmol7! 
(12.511M), Perkin Elmer). Reactions were incubated at 25°C for 2h then passed 
through a Nanosep Centrifugal Device (10 kDa molecular weight cut-off). The flow 
through was collected, mixed with 0.511 (2.5 U) RppH (NEB), 0.541 NudC (400 nM 
final) or 0.511 NudC storage buffer, incubated at 37°C for 30 min, and mixed 
with an equal volume of urea gel loading buffer (10 M urea (Calbiochem), 0.5 M 
EDTA, pH 8.0 (JT Baker), 1 x TBE). Samples were run on 25% 19:1 acrylamide: 
bisacrylamide gel (from 40% 19:1 solution (BioRad), 7M urea (Calbiochem), 
1x TBE (90 mM Tris base; BioRad), 90 mM boric acid (Calbiochem), 2mM EDTA 
(JT Baker)). Autoradiography of gels was performed using storage phosphor 
screens and a Pharos (Bio-Rad) imager. Radiolabelled products were visualized 
using ImageLab (Bio-Rad) software. 

Structure determination: RPo-pppApC. Crystals of T. thermophilus RPo were 
prepared using a derivative of the consensus bacterial promoter nucleic acid scaf- 
fold used for analysis of RPo in ref. 13 having A in place of C at position —1 of 
the DNA template strand, and were grown and handled essentially as described!’ 
(T. thermophilus RNAP is a bacterial RNAP that is homologous in structure, 
sequence, and mechanism to E. coli RNAP, but, empirically, yielding crystals of 
RNAP that diffract to higher resolution than E. coli RNAP!?”’.) Crystallization 
drops contained 111 RPo in 20 mM Tris-HCl, pH 7.7, 100 mM NaCl, and 1% 
glycerol, and 11] reservoir buffer (RB; 100 mM Tris-HCl, pH 8.4, 200 mM KCI, 
50mM MgCh, and 9.5% PEG4000), and were equilibrated against 400,11 RB in 
a vapour-diffusion hanging-drop tray. Rod-like crystals appeared in 1 day, and 
were used to micro-seed hanging drops using the same conditions. ATP and CTP 
(Sigma-Aldrich) were soaked into RPo crystals by addition of 0.2 jl 40 mM ATP 
and 40 mM CTP in 60% (v/v) RB to the crystallization drop, and incubated for 
2min at 22°C. Crystals were transferred into reservoir solutions containing 2mM 
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ATP, 2mM CTP, and 17.5% (v/v) (2R, 3R)-(-)-2,3-butanediol (Sigma-Aldrich) and 
were flash-cooled with liquid nitrogen. 

Diffraction data were collected at BNL beamline X29A (temperature, 100 K; 

wavelength, 1.08 A), processed and scaled using HKL2000”*. Structure factors 
were converted using the French-Wilson algorithm in Phenix” and were subjected 
to anisotropy correction using the UCLA MBI Diffraction Anisotropy server*” 
(http://services.mbi.ucla.edu/anisoscale/). The structure was solved by molecu- 
lar replacement with Molrep*! using one RNAP molecule from the structure of 
T. thermophilus RPo (PDB 4G7H)'¥ as the search model. Early-stage refinement 
included rigid-body refinement of each RNAP molecule, followed by rigid-body 
refinement of each subunit of RNAP molecules. Cycles of iterative model building 
with Coot* and refinement with Phenix’ were performed. Atomic models of the 
DNA nontemplate strand, the DNA template strand, and RNA were built into 
mF, — DF, omit maps, and subsequent cycles of refinement and model building 
were performed. The final crystallographic model of RPo-pppApcC, refined to 
Ramachandran statistics of 97.84% favoured, 2.10% allowed, and 0.06% outliers, 
has been deposited in the PDB with accession code 5D4C. 
Structure determination: RPo-NAD*pC. NAD* (Sigma-Aldrich) and CTP were 
soaked into RPo crystals (prepared as described above) by addition of 0.2 jl 30 mM 
NAD* and 40mM CTP in 50% (v/v) RB to the crystallization drop, and incu- 
bated for 2h at 22°C. Crystals were transferred into reservoir solutions containing 
1.5mM NAD*, 2mM CTP, and 17.5% (v/v) (2R, 3R)-(-)-2,3-butanediol and were 
flash-cooled with liquid nitrogen. 

Diffraction data were collected at APS beamline 19-ID-D (temperature, 100 K; 

wavelength, 0.98 A), processed and scaled using HKL2000°, and subjected to ani- 
sotropic correction using the UCLA MBI Diffraction Anisotropy server®” (http:// 
services.mbi.ucla.edu/anisoscale/). The structure was solved and refined using 
procedures analogous to those described above for RPo-pppApC. The final crys- 
tallographic model of RPo-NAD “pC, refined to Ramachandran statistics of 97.81% 
favoured, 2.15% allowed, and 0.04% outliers, has been deposited in the PDB with 
accession code 5D4D. 
Structure determination: RPo-dpCoApC. dpCoA (Sigma-Aldrich) and CTP were 
soaked into RPo crystals (prepared as described above) by addition of 0.2 11 60 mM 
3’-desphospho-CoA and 40 mM CTP in 50% (v/v) RB to the crystallization drop, 
and incubated for 2h at 22°C. Crystals were transferred into reservoir solutions 
containing 3 mM 3’/-desphospho-CoA, 2mM CTP, and 17.5% (v/v) (2R, 3R)-(-)- 
2,3-butanediol and were flash-cooled with liquid nitrogen. 

Diffraction data were collected at APS beamline 19-ID-D (temperature, 100 K; 
wavelength, 0.98 A), processed and scaled using HKL200078, and subjected to ani- 
sotropic correction using the UCLA MBI Diffraction Anisotropy server®” (http:// 
services.mbi.ucla.edu/anisoscale/). The structure was solved and refined using 
procedures analogous to those described above for RPo-pppApC. The final crys- 
tallographic model of dpCoApC, refined to Ramachandran statistics of 97.54% 
favoured, 2.43% allowed, and 0.03% outliers, has been deposited in the PDB with 
accession code 5D4E. 

Analysis of dpCoA by liquid chromatography (LC)-UV. AMP (Sigma-Aldrich) 
and dpCoA were diluted to 1 mM in water (HPLC grade) before injection. LC-UV 
analyses were performed using a Waters 2960 HPLC system coupled with a Waters 


2996 photodiode array detector. The flow rate from the Finnigan Surveyor pump 
was 0.3mlmin~!. The autosampler temperature was 5°C. The LC method used 
an YMC ODS-A 5,.m particle size 120 A pore size column, 3.0mm x 100mm. 
The samples were separated using a gradient mobile phase consisting 
of 25mM triethylammonium formate buffered to pH 3.0 in water (A) and 
methanol (B). The gradient condition was: 0-5 min, 100% A; 5-15 min, 
100-30% A; 15-23 min, 30-0% A; 23-28 min, 0% A; 28-40 min, 100% A. Column 
temperature was 25°C. Detector wave length was 260 nm. The instrument control, 
data acquisition, and data analysis were performed by MassLynx software. 
Triethylammonium acetate buffer (2.0 M) and formic acid (>95%) were purchased 
from Sigma-Aldrich. 
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Extended Data Figure 1 | De novo transcription initiation by ATP and 
NCINs. a, Structures of ATP, NAD*, NADH, and dpCoA. Red, identical 
atoms. b, Initial RNA products of in vitro transcription reactions with 
ATP, NAD*, NADH, or dpCoA as initiating nucleotide and [a*?P]-CTP as 
extending nucleotide (E. coli RNAP; Pnap see analogous data for Pyaay in 


Fig. 1b). Products were treated with RppH (processes 5’-triphosphate RNA 
to 5’-monophosphate RNA and 5/-NTP to 5/-NDP/5’/-NMP*!*) or NudC 
(processes 5/-NAD*/NADH-capped RNA to 5/-monophosphate RNA®) as 
indicated. For gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 2 | LC/MS/MS analysis of initial RNA products ion chromatogram (signal derived from detection of parent ion of 
of in vitro transcription reactions with NAD‘ as initiating nucleotide m/z=967 and CID fragment of m/z= 845 corresponding to NAD* pC 
and CTP as an extending nucleotide. a, Structure of NAD*pC minus nicotinamide). Reactions contained the indicated components. 
(red, atoms corresponding to CID-generated fragment ion). b, Extracted c, Mass spectrum of CID fragment. 
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Extended Data Figure 3 | Sensitivity of full-length RNA products to 
alkaline phosphatase treatment. Full-length RNA products of in vitro 
transcription reactions with [7~*P]-ATP or [a**P]-NAD* as initiating 
nucleotide and CTP, GTP, and UTP as extending nucleotides (E. coli 
RNAP; Pyar fused to an A-less cassette ). Products were treated with 
alkaline phosphatase (AP; processes 5’ phosphates) or NudC (processes 
5’-NAD*/NADH-capped RNA to 5’/-monophosphate RNA®) as indicated. 
Results indicate that full-length RNA products generated in reactions with 
[a**P]-NAD* as initiating nucleotides are not sensitive to AP until they 
are processed by NudC. M, 100-nucleotide marker. For gel source data, see 
Supplementary Fig. 1. 
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Extended Data Figure 4 | Promoter-sequence effects on efficiency of 1mM NAD‘ as initiating nucleotides and [a*”P]-CTP as extending 
NCIN-mediated transcription initiation: NAD*. a, Templates having nucleotide (E. coli RNAP3 Prnap Pgadys Pras, or Pr7a1). (We note that 
rnal, gadY, N25, and T7A1 promoters used in the assays. b, Representative | contaminating AMP in the NAD* stock results in production of pAp*C.) 
raw data from experiments of Fig. 2b. Initial RNA products of in vitro For gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 5 | Promoter-sequence effects on efficiency of (E. coli RNAP3 Prnat Peadys Pos, or P71). b, Left, dependence of dpCoA- 
NCIN-mediated transcription initiation: NADH and dpCoA. a, Left, capping on [dpCoA]/[ATP] ratio (mean + s.e.m. of three determinations). 
dependence of NADH capping on [NADH ]/[ATP] ratio (mean + s.e.m. Right, relative efficiencies of dpCoA capping. (E. coli RNAP; Prnap Peadys 
of three determinations). Right, relative efficiencies of NADH capping. Pyos, or Pray). 
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transcription reactions with ATP, NAD‘, or NADH as initiating nucleotide data, see Supplementary Fig. 1. 
and [a**P]-UTP as extending nucleotide. Reactions were performed with 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


template strand template strand 


i-1 site i site i+1 site [, site 


b > 


template strand 


i-1 site i site i+1 sitef site 


NAD*tpC 
isite i+1 site ~ Esite i-1 site 


NAD*pC 


i-1 site i site 


Cc 


i-1 site isite i+1 site (\ E site 


i-1 site isite i+1 site _E site 


Extended Data Figure 7 | Structural basis of NCIN-mediated (contoured at 2.50 in a, b and 2.20 in c); red, DNA; pink, RNA product 
transcription initiation: stereoviews. a—c, Crystal structures of and diphosphate in ‘E site’ (see refs 15-17); violet spheres, Mg** (I) and 
RPo-pppApC, RPo-NAD*pC, and RPo-dpCoApC. Stereo views of Mg?* (II); grey, RNAP bridge helix. 


density and fit for initial RNA product. Green mesh, F, — F. omit map 
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Extended Data Figure 8 | AMP content of dpCoA stock. HPLC observation that the dpCoA stock contains ~2% AMP in the dpCoA stock 
chromatogram of dpCoA stock (Sigma-Aldrich, lot SLBJ2886V; 50 nmol). accounts for the formation of pApC in reactions performed with dpCoA 
Green, HPLC chromatogram of AMP (20 nmol). Comparison of (Fig. 1b). 


chromatograms indicates that the dpCoA stock contains ~2% AMP. The 
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Extended Data Table 1 | Data collection and refinement statistics 


RPo-pppApC RPo-NAD*pC RPo-dpCoApC 

Data collection 
Space group P2(1) P2(1) P2(1) 
Cell dimensions 

a, b, c (A) 186.0, 103.6, 297.4 185.0, 103.5, 296.3 185.8, 103.9, 297.1 

a, B,y (°) 90.0, 98.3, 90.0 90.0, 98.3, 90.0 90.0, 98.5, 90.0 
Resolution (A) 50.00-3.30(3.36-3.30) * 50.00-3.00(3.05-3.00) * 40.00-3.10(3.15-3.10) * 
Rsym OF Rmerge 0.185(0.954) 0.119(0.920) 0.120(0.840) 
I/oI 8.7(1.7) 10.6(1.5) 10.4(1.4) 
Completeness (%) 0.961(0.875) 0.979(0.934) 0.989(0.960) 
Redundancy 5.4(5.0) 4.0(3.7) 3.8(3.5) 
Refinement 
Resolution (A) 50.00-3.30 50.00-3.00 40.00-3.10 
No. reflections 154105 218773 187101 
Rwork’ Reree 0.211/0.258 0.201/0.254 0.210/0.247 
No. atoms 

Protein 55006 54790 55297 

Ligand/ion 1536 1664 1651 

Water 58 895 775 
B-factors 

Protein 48.0 312 49.1 

Ligand/ion 68.0 65.3 80.4 

Water 12.9 7 27.4 
R.m.s deviations 

Bond lengths (A) 0.004 0.008 0.004 

Bond angles (°) 0.79 0.95 0.69 


«Highest resolution shell is shown in parentheses. 
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Allosteric nanobodies reveal the dynamic range and 
diverse mechanisms of G-protein-coupled receptor 


activation 


Dean P. Staus!*, Ryan T. Strachan?*, Aashish Manglik**, Biswaranjan Pani!, Alem W. Kahsai!, Tae Hun Kim‘, Laura M. Wingler!, 
Seungkirl Ahn!, Arnab Chatterjee!, Ali Masoudi!, Andrew C. Kruse’, Els Pardon®’, Jan Steyaert®”’, William I. Weis®®, 
R. Scott Prosser, Brian K. Kobilka?, Tommaso Costa? & Robert J. Lefkowitz!104 


G-protein-coupled receptors (GPCRs) modulate many 
physiological processes by transducing a variety of extracellular 
cues into intracellular responses. Ligand binding to an extracellular 
orthosteric pocket propagates conformational change to the receptor 
cytosolic region to promote binding and activation of downstream 
signalling effectors such as G proteins and (-arrestins. It is well 
known that different agonists can share the same binding pocket 
but evoke unique receptor conformations leading to a wide range 
of downstream responses (‘efficacy’)'. Furthermore, increasing 
biophysical evidence, primarily using the 82-adrenergic receptor 
(82AR) as a model system, supports the existence of multiple active 
and inactive conformational states?~>. However, how agonists with 
varying efficacy modulate these receptor states to initiate cellular 
responses is not well understood. Here we report stabilization of 
two distinct 8,AR conformations using single domain camelid 
antibodies (nanobodies)—a previously described positive allosteric 
nanobody (Nb80)®’ and a newly identified negative allosteric 
nanobody (Nb60). We show that Nb60 stabilizes a previously 
unappreciated low-affinity receptor state which corresponds to 
one of two inactive receptor conformations as delineated by X-ray 
crystallography and NMR spectroscopy. We find that the agonist 
isoprenaline has a 15,000-fold higher affinity for 8,AR in the 
presence of Nb80 compared to the affinity of isoprenaline for 8.AR 
in the presence of Nb60, highlighting the full allosteric range ofa 
GPCR. Assessing the binding of 17 ligands of varying efficacy to 
the B2AR in the absence and presence of Nb60 or Nb80 reveals large 
ligand-specific effects that can only be explained using an allosteric 
model which assumes equilibrium amongst at least three receptor 
states. Agonists generally exert efficacy by stabilizing the active 
Nb80-stabilized receptor state (Rgo). In contrast, for a number of 
partial agonists, both stabilization of Rgo and destabilization of 
the inactive, Nb60-bound state (R¢o) contribute to their ability to 
modulate receptor activation. These data demonstrate that ligands 
can initiate a wide range of cellular responses by differentially 
stabilizing multiple receptor states. 

The allosteric behaviour of GPCRs is responsible for the complex 
signalling properties associated with these important regulators of 
human physiology. GPCR allostery, defined here as a linkage between 
the extracellular orthosteric ligand pocket and the intracellular 
G-protein-binding pocket, has long been analysed by pharmacological 
methods*"!° (see Supplementary Information). Conformational 
changes within a GPCR induced by agonist binding can enhance the 


affinity and binding of intracellular signalling transducers, such as 
G proteins and -arrestins. Conversely, transducer coupling further 
enhances agonist affinity, resulting in the formation of the ternary 
complex of receptor, intracellular signalling transducer, and ligand 
(Fig. 1a). The conceptual framework of the ternary complex model 
equates the magnitude of these affinity changes with the strength 
of transducer activation in cells!!!*, as demonstrated for several 
GPCR systems’*"'®. However, the structural basis underlying these 
allosteric relationships and how they relate to ligand efficacy is not 
well understood. 

Ligand-dependent GPCR activation has traditionally been 
conceptualized as a conversion between a single inactive and a 
single active receptor state. However, recent studies using various 
spectroscopic techniques have identified multiple inactive and active 
receptor states, suggesting that the mechanisms underlying receptor 
activation may be more complex than previously thought”. To better 
understand how ligands with varying efficacies may differentially 
regulate these conformations, we sought to develop reagents to 
stabilize specific inactive and active conformations of the B2AR. We 
and others have previously used Nb80, a Gs mimetic nanobody, to 
stabilize an active conformation of the BoAR®%”, Indeed, competition 
radioligand binding assays using iodinated cyanopindolol ([!*°I]CYP) 
and 3,AR reconstituted into high-density lipoprotein (HDL) particles 
(nanodiscs) demonstrated that Nb80 increases the affinity of the agonist 
isoprenaline by 75-fold (Fig. 1b), which is similar, but not identical, to 
the 33-fold increase seen in the presence of purified heterotrimeric Gs 
(Fig. 1b). To investigate the pharmacological properties of the inactive 
receptor, we identified a nanobody (Nb60) that preferentially bound 
inverse-agonist-bound 3,AR’. Remarkably, though the affinity of the 
receptor for agonist in the absence of Gs or Nb80 was presumed to 
reflect the pharmacological properties of the inactive state, the presence 
of Nb60 reduced agonist affinity by approximately 70-fold (Fig. 1b). 
The effects of Nb60 and Nb80 on radiotracer affinity were negligible 
and could not account for the large changes in affinity (Extended Data 
Table 1). 

To further quantify the allosteric effects of Nb60, Nb80, and Gs on 
agonist binding, we measured isoprenaline affinity with radioligand 
competition binding over a range of nanobody/Gs concentrations 
(Extended Data Fig. 1). As the concentration of allosteric modulator 
increases, the effect on isoprenaline affinity becomes saturable, 
reaching two opposite plateau values with Nb60 and Nb80 or Gs 
(Fig. 1c). This is a hallmark pattern of true allosteric interactions, with 
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Figure 1 | Allosteric nanobodies have opposing effects on agonist 
affinity for the 8.AR. a, Schematic of the ternary complex model. Ligand 
(L) affinity to receptor (R) increases in the presence of transducer (T), this 
allosteric linkage is denoted by dashed line with arrows. b, Compared to 
the absence of modulator, Nb60 decreases isoprenaline affinity (negative 
cooperativity) and Nb80 and Gs increases affinity (positive cooperativity) 
as assessed by radioligand competition assays using 82AR HDL particles. 
c, The effects of Nb60 and Nb80 or Gs on isoprenaline affinity are 


the net log-change in isoprenaline affinity at saturating concentration 
of nanobody/Gs gauging the extent of cooperativity between nanobody 
and agonist binding (negative for Nb60 or positive for Nb80 and Gs) 
(see Supplementary Information). As predicted by the ternary complex 
model, this coupling energy, termed a, must be constant—the effect 
of bound nanobody on agonist binding is reciprocal to the effect of 
bound agonist on nanobody binding. To verify such a prediction, we 
measured the affinity of Nb60 for the 8,AR in the absence and presence 
of agonist (isoprenaline) using isothermal titration calorimetry 
(Fig. 1d, e). As expected, the affinity of Nb60 for 8,AR decreased in the 
presence of isoprenaline. Consistent with its preference for the inactive 
state, Nb60 dose-dependently increased binding of the radiolabelled 
inverse antagonist PHJICI-118,551 to the 8.AR, whereas binding was 
decreased in the presence of Nb80 (Fig. 1f). Together, these data show 
that Nb60 and Nb80 are potent allosteric modulators that can be used 
to stabilize inactive and active 3,AR states. 

The decrease in isoprenaline affinity observed in the presence of 
Nb60 reveals a previously unappreciated ‘very-low-affinity’ state 
(Ky,) in competition binding experiments. The affinity of agonist 
for an uncoupled GPCR has traditionally been referred to as the 
‘low-affinity (K,) state; however, our results show that K; values 
reflect binding of an ensemble of conformations that exchange rapidly 
over the course of the binding reaction. This phenomenon is probably 
conserved among GPCRs, as a similar Ky, state has been observed 
with the Az, adenosine receptor using an antibody fragment’”. This 
conformational heterogeneity is consistent with recent spectroscopic 
and computational studies, which have shown that the 82AR exists in 
multiple inactive, intermediate, and active conformations that exchange 
within milliseconds”~>. To assess which receptor state Nb60 stabilizes, 
we conducted !F fluorine NMR spectroscopy of 3,AR labelled with a 
trifluoroacetanilide probe at the endogenous residue C265 located at 
the cytoplasmic end of transmembrane 6 (TM6). As shown previously, 
the unliganded 6,AR exists in two equally distributed inactive states 
(termed S1 and S2) that exchange on a fast timescale (700 + 137 1s), and 
complete conversion into the active $4 state requires both agonist and 
transducer binding? (Fig. 2a, b). Using structural insights from double 
electron-electron resonance studies, $1 was identified as an inactive 
state with an interaction between TM3 residue R131** (superscripts 
indicate Ballesteros-Weinstein numbering for GPCRs!*) and E268°*° 


saturable functions of their concentration. d, e, The affinity of Nb60 for 
unliganded BAR (d), represented by a tight isotherm sigmoidal binding 
curve’, is reduced in the presence of isoprenaline (iso) (e), as determined 
by isothermal titration calorimetry. f, Nb60 dose-dependently increases 
and Nb80 decreases the binding of the radiolabelled antagonist [*H]ICI- 
118,551 to the BAR. All radioligand binding studies represent a minimum 
of three independent experiments with deviation shown as the standard 
error. 


in TM6 (ref. 5), commonly termed the ionic lock. The ionic lock has 
previously been shown to be important in maintaining the inactive con- 
formation of BAR, as charge-neutralizing mutations at these positions 
increase receptor constitutive activity’. Additionally, the $2 conforma- 
tion was also identified as an inactive state but with a disengaged ionic 
lock (Fig. 2b). The binding of G protein or other positive allosteric 
modulators such as Nb80 lowers the energy of the active receptor states, 
driving the receptor from $2 towards active conformations®. The !°F 
NMR spectra showed that the addition of Nb60 to 8.AR bound to 
the inverse agonist carazolol shifted the $1-S2 equilibrium towards 
the inactive S2 state (Fig. 2c), providing a mechanism for its negative 
cooperative effects on isoprenaline affinity. Given the broad NMR line 
shape (Fig. 3c, red line) of the 82AR when bound to carazolol and Nb60, 
we conducted Carr-Purcell-Meiboom-Gill (CPMG)”! relaxation 
dispersion measurements to measure potential conformational 
heterogeneity. We found that the 8.AR when bound to Nb60 and 
carazolol interconverts (860 -+ 530s ') between S1 and $2, but is predom- 
inantly found (75-90%) in the S2 inactive state (Extended Data Fig. 1d). 
To further decipher how Nb60 induces a negative cooperative effect 
on agonist binding, we determined a 3.2 A X-ray crystal structure of 
a ternary complex comprised of 82AR, Nb60, and the inverse agonist 
carazolol (Fig. 2d-f, Extended Data Fig. 2 and Extended Data Table 2). 
The complementary determining region 3 (CDR3) of Nb60 inserts 
into a similar 82AR allosteric pocket as G protein and Nb80, located 
between the cytoplasmic ends of TM3, TM4, and TM6 (Fig. 2d, e). 
We found that T102 and Y106 in Nb60 bridge an interaction between 
residues R1313%° and E268°*”. This interaction does not exist in the 
absence of Nb60 (Protein Data Bank (PDB) accession code 2RH1), 
indicating that Nb60 stabilizes an inactive conformation through inter- 
actions with the 82AR ionic lock (Fig. 2e). However, this polar network 
appears ‘disengaged’ compared to the fully closed ionic lock in the 
B,-adrenergic receptor (PDB accession code 2YCW)” (Fig. 2e), thus 
supporting the '"F NMR data showing that Nb60 specifically stabilizes 
the S2 inactive state. The insertion of Nb60 F103 into a hydrophobic 
pocket in the 82AR may also contribute to the affinity and/or negative 
allosteric properties of Nb60 (Extended Data Fig. 2c). Confirming the 
importance of T102 and F103 for the 8. AR-Nb60 interaction, alanine 
mutations at these positions inhibited Nb60 binding to the B2AR and 
the negative cooperative effects on isoprenaline binding (Extended 
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Figure 2 | Nb60 stabilizes the S2 inactive state by coordinating the 82.AR 
ionic lock. a, b, Cartoon depicting a side (a) or cytoplasmic (b) view of 
the 8,AR transmembranes (TM). Conversion from the two inactive states 
(S1 and S2) to the active S4 state requires both agonist and transducer 

(G protein) binding and is represented by a 14 A outward movement of 
TM6. c, '°F NMR spectroscopy of the 3,AR with the antagonist carazolol 
(Cz) + Nb60. d, The 3.2 A structure of the BAR bound to carazolol (Cz) 


Data Fig. 2e, f). Other than changes within the ionic lock, the overall 
structure of 82AR bound to Nb60 is highly similar to the previously 
determined inactive 3,AR structure bound to carazolol alone (root 
mean squared deviation (r.m.s.d.) of 0.3 A for the transmembrane 
domains and orthosteric binding pocket, Fig. 2f). Taken together, the 
pharmacological, biophysical, and crystallographic studies show that 
Nb60 exerts its negative allosteric effect on agonist binding by stabiliz- 
ing the S2 inactive 8,AR conformation. 

Our observation that isoprenaline bound to the Nb80-stabilized 
active B2AR with approximately 15,000-fold greater affinity than to 
the Nb60-stabilized inactive 8,AR (Fig. 1b, c) provides a measurement 
of the full allosteric power of an agonist to activate a GPCR. The large 
free energy difference (—24kJ mol) between these states is probably 
important for GPCR function, allowing agonist-stimulated activity to 
be significantly higher than that of the basal activity. As simulated for 
a full agonist in Fig. 3a, the overall affinity shift from inactive to active 
receptor (black arrow) is comprised of two components, the negative 
cooperative (cx) effects of Nb60 (aNb60, blue) and positive effects of 
Nb80 (aNb80, red). Given the complexities and limitations of using 
NMR and crystallography to gain mechanistic insights into ligand 
activation of a GPCR, we used a pharmacological approach to quantify 
oNbé60 and aNb80 for 17 82AR ligands of varying efficacy (Fig. 3b and 
Extended Data Fig. 3). 

We first identified a significant positive correlation (r= 0.8514, 
P=0.004) between aNb80 values (K,/Ky ratios) and the relative 
intrinsic efficacies (tT values) of various ligands obtained from cellular 
G-protein assays (Extended Data Fig. 4a, b). This finding provides 
additional support for the claim that Nb80 exerts allosteric effects that 
mimic those ofa G protein. It also confirms our previous reports that 
ligand efficacy is not a product of modified downstream signalling 
events but is rather achieved at the level of ternary complex interactions, 
reflecting the allosteric interactions between different ligands and 
transducers". Interestingly, we find no significant correlation between 
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Figure 3 | Nb60 and Nb80 have varying effects on the affinity 

of different 32AR ligands. a, Schematic depicting the use of equilibrium 
radioligand binding studies to quantify the cooperativity 

(cx) between Nb60 or Nb80 binding and ligand affinity (see Methods 
and Supplementary Information). c.p.m., counts per minute. 

b, Cooperativity values for Nb60 (aNb60) and Nb80 (aNb80) for 6,AR 
ligands with varying efficacies. Ligands are ordered by magnitude 

of aNb80. c, Correlation plot of ~Nb60 and aNb80; regression 

shown as solid red line with 95% confidence interval (dotted red line). 
All a values derived from at least three independent radioligand 
binding experiments with the deviation depicted as standard error. 
Adr, adrenaline; alp, alprenolol; carv, carvedilol; caraz, carazolol; 

clen, clenbuterol; fen, fenoterol; form, formoterol; hbi, hydroxybenzyl 
isoproterenol; ICI, ICI-118,551; iso, isoprenaline; isoe, isoetharine; 

pin, pindolol; proc, procaterol; salb, salbutamol; salm, salmeterol; zint, 
zinterol. 
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aNb60 and ligand efficacy, providing the first evidence, to our 
knowledge, that ligands perceive the Nb60 and Nb80 stabilized recep- 
tor states differently (Extended Data Fig. 4c). We observed a signifi- 
cant negative correlation (r= 0.7417, P=0.0013) between aNb60 and 
aNb80 values (Fig. 3c) across all ligands, suggesting these nanobodies 
stabilize functionally opposite conformations. However, the relation- 
ship between aNb60 and aNb80 was unexpectedly complex (Fig. 3b). 
For example, several full agonists exhibited comparable levels of positive 
(Nb80) and negative (Nb60) cooperativity (noradrenaline, adrenaline, 
and isoprenaline), whereas some partial agonists displayed patterns 
with surprising discrepancies (clenbuterol, salbutamol, and zinterol) 
(Fig. 3b). Importantly, for a subset of these ligands we confirmed that 
the allosteric effects of Nb80 are consistent with those elicited by the 
physiological transducer heterotrimeric Gs (Extended Data Fig. 5a, b), 
Moreover, the surprisingly divergent, ligand-specific effects of Nb60 
were also observed with another inactive state-stabilizing nanobody, 
A11 (Extended Data Fig. 5c). Together, these data indicate that the 
ligand-specific effects of Nb80 and Nb60 are not nanobody-specific, 
but rather reflect how ligands perceive specific receptor conformations. 

We next tested two different allosteric models to try to explain how 
the different conformations stabilized by Nb60 and Nb80 can generate 
the observed cooperativities. First we tested whether the dynamics of 
receptor states can be sufficiently modelled as a simple interconversion 
between two allosteric conformations, despite biophysical evidence for 
multiple inactive and active receptor states. Accordingly, we attempted 
to fit the experimentally observed aNb60 and aNb80 values for all 
ligands using the two-state model of receptor activation (Fig. 4a). In 
this simulation, the equilibrium constant J represents the distribution 
of inactive (Reo) and active (Rgo) receptor states in the absence of 
ligand. The variable 3 describes the effect that each ligand has on 
the distribution of receptor states (J), with agonists displaying larger 
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Figure 4 | 82AR agonists differentially stabilize receptor states to 
regulate receptor activation. a, b, Illustration of a two-state (a), or 
three-state (b), model of receptor activation describing the effect of B,AR 
ligands on receptor conformations stabilized by Nb60 (Rgo) or Nb80 (Rgo). 
The equilibrium (J) between receptor states can be influenced by ligand 
binding through the allosteric factor (3. The theoretical cooperativity (a) 
between nanobody and ligand binding derived from the two-state model 
(dashed black line) fails to predict the observed a values for a subset of 
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@ values (that is, they stabilize more Rgo relative to Reo). As shown in 
Fig. 4a, the theoretical curve generated by varying /3 in the two-state 
model (dotted black line) failed to accurately predict the experimen- 
tally determined aNb60 and aNb80 values for 30% of ligands tested, 
consisting primarily of partial agonists (dotted red oval). A different 
equilibrium J constant would be required to explain the cooperativity of 
these ligands with the same two-state model. These findings argue that 
ligands must modulate more than these two states to control receptor 
activation. 

Biophysical evidence supports the existence of at least three predomi- 
nant receptor states; therefore we hypothesized that a three-state model 
(Fig. 4b) may better explain the experimentally observed aNb60 and 
oNb80 values. In this model, the equilibria of Rgo (J1) and Reo (J2) can 
be regulated separately by ligands, described by the allosteric factors 
(1 and (3, respectively. We found that the observed cooperativities 
for 12 ligands (Fig. 4b, class I, orange), which encompasses all the full 
agonists, can be predicted if these ligands primarily stabilize the active 
Rgo state, while having negligible effects on the inactive R¢o state. The 
efficacy of these ligands is thus directly proportionally related to their 
effect on the Rgo equilibrium (;), such that highly efficacious ligands 
have a large 3; (adrenaline), whereas partial agonists have a lower 3; 
(clenbuterol). In contrast, the observed cooperativity of other partial 
agonists (classes II and III) could only be predicted by differentially 
modulating the Rgo and Reo equilibria, suggesting these ligands regulate 
multiple receptor states to control receptor activation. For example, 
even though clenbuterol (class I), zinterol (class II), and procaterol 
(class III) are similar partial agonists (Extended Data Fig. 4a), the 
mechanism by which they promote receptor activation varies based 
on their ability to differentially stabilize and destabilize the active (Rgo) 
and inactive (R¢o) states, respectively (Fig. 4b). Importantly, divergences 
in receptor activation mechanisms can only be uncovered by studying 
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ligands (dashed red oval). However, the observed cooperativity values can 
be accurately predicted using an allosteric model in which ligands can 
differentially modulate three independent receptor states (three-state). 
Certain ligands (orange) primarily stabilize the active Rgo state, whereas 
others (purple or green) can stabilize Rgo but simultaneously destabilize 
the inactive R¢o state. All « values are derived from at least three 
independent radioligand binding experiments with the deviation depicted 
as standard error. 


21 JULY 2016 | VOL 535 | NATURE | 451 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


ligand effects at both the Nb60-stabilized and Nb80-stabilized 82AR 
states. The inability to accurately predict efficacy (G-protein activation) 
based on ligand class supports the notion that molecular efficacy is a 
complex process that probably involves many more unresolved receptor 
states. 

These findings reshape our current understanding of GPCR activa- 
tion. For almost 40 years, studies of these receptors have been guided by 
the understanding that they can exist in two forms, one of high affinity 
for agonists (active, coupled to the G protein) and one of low affinity 
(inactive, not coupled to the G protein), which are interconverted 
by the cooperative interaction of agonists and guanine nucleotides. 
We used conformationally sensitive nanobodies (Nb60 and Nb80) 
to reveal that Nb60-bound 82AR has affinity ~100-fold lower than 
the previously described low-affinity inactive state, which is in fact an 
‘average’ of multiple, rapidly interconverting inactive and active states. 
The implication is that the full allosteric potential of the 8,AR is orders 
of magnitude greater than formerly appreciated. Moreover, binding 
cooperativity between ligands of varying efficacy and Nb60 and Nb80 
could only be explained by models featuring multiple (at least three) 
receptor states, which are differentially stabilized and/or destabilized by 
various agonists. Thus these data demonstrate a previously unsuspected 
heterogeneity in the mechanisms by which different agonists stimulate 
biological responses. In view of the general conservation of GPCR 
activation mechanisms, and a report of an Az, adenosine receptor 
‘Nb60-like’ state, these findings will probably apply to many GPCRs. 
Given the primacy of GPCRs as therapeutic targets, these new findings 
and concepts may also provide a basis for the design of drugs with 
potentially new therapeutic properties. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized. The investigators were not blinded to 
allocation during experiments and outcome assessment. 

B2AR reconstitution in HDL particles. Human 8,AR containing an amino- 
terminal Flag epitope tag, carboxyl-terminal HIS-tag, and a N187E glycosylation 
mutation was expressed in Sf9 insect cells using the BestBac baculovirus system 
(Expression systems; Davis, CA). Cells were infected at a density of 3 x 10° 
cells per ml and harvested 66h afterwards. Functional Flag-3AR was purified 
using Flag-M1 antibody and alprenolol affinity chromatography as previously 
described‘. Flag-3AR was reconstituted into HDL particles (nanodiscs) 
according to previously published methods”. In brief, Flag—3.AR was incubated 
with a 50-fold molar excess of membrane scaffold protein 1 (MSP1) and 8mM 
POPC:POPG (3:2 molar ratio) for 1h at 4°C. Detergent was removed via biobeads 
overnight at 4°C, and receptor-containing nanodiscs were isolated using Flag~M1 
chromatography and size-exclusion chromatography. 

Nanobody and Gs purification. Nanobodies were generated and purified as 
previously described’. In brief, Nb60 and Nb80 clones were originally obtained 
from one llama (Lama glama) immunized with 82AR (truncated at residue 
365) reconstituted into liposomes bound to the high affinity agonist BI-167107. 
Nanobody coding sequences were amplified from isolated lymphocyte RNA and 
cloned into phage display vector. Conformationally selective nanobodies were 
isolated using a series of biopanning procedures targeted against native receptor 
bound to agonist. Nanobodies were grown in E. coli WK6 cells and purified 
from the periplasmic extract using nickel-agarose and resource S ion exchange 
chromatography. Heterotrimeric Gs protein was expressed in and purified from 
Sf9 insect cells using nickel-agarose and ion exchange chromatography as described 
previously”®. 

Radioligand binding assays. Competition binding assays (250 11) contained 
60 pM [!*°I]cyanopindolol, a serial dilution of competitor, the indicated 
concentration of nanobody/Gs, and approximately 0.5 ng of 82AR nanodiscs 
diluted in assay buffer (50mM Tris-HCl pH 7.4, 12.5mM MgCh, 2mM EDTA, 
0.05% BSA, 1mM t-ascorbic acid). Total binding was determined in the absence 
of competitor; nonspecific binding was determined using 10\.M propranolol. 
Following a 90 min incubation at room temperature, binding assays were 
terminated by rapid filtration onto GF/B glassfibre filters treated with 0.3% PEI and 
washed with 8 ml of cold binding buffer using a harvester (Brandel, Gaitherburg, 
MD). Bound [!”°I] was quantified using a Packard Cobra Quantum gamma counter 
(Packard, San Diego, CA) and expressed as specific binding. For PHJICI-118,551 
binding, 10 ng of 8,AR nanodiscs were incubated with 0.3nM radioligand and 
varying concentrations of nanobody and were harvested as described above. [I'*°] 
cyanopindolol affinity was determined using saturation binding (Extended Data 
Table 1). All binding data represent a minimum of three independent experiments 
with deviation represented as standard error. 

Isothermal titration calorimetry (ITC). ITC experiments were carried out 
using the MicroCal iTC200 system. 3,AR and Nb60, purified as described above, 
were both dialysed against a buffer consisting of 20 mM HEPES pH 7.5, 100mM 
NaCl, 0.01% lauryl maltose neopentyl glycol (MNG), and 0.001% cholesteryl 
hemisuccinate (CHS). Protein concentrations were determined by measurement 
of the absorbance at 280 nm, using molar extinction coefficient per cm parameters 
of each protein (€ for 8,AR: 66,350cm | and Nb60: 21,430 cm !). Nb60 at 100\1M 
concentration (4011) was loaded into the syringe and titrated into the 200 1l sample 
cell containing 3,AR (101M) in absence or presence of 200|1M isoprenaline. 
Titrations were performed at 25°C using an initial injection of 0.511, followed 
by 2.3 11 injections (1s duration, 300s spacing, and 5s filter period). Reference 
power was set to 10j1cals-! and stirring speed to 750 r.p.m. Reference titrations 
were obtained by injecting Nb60 alone or Nb60 with agonist (isoprenaline) 
into sample cells containing buffer alone. To obtain the stoichiometry (N) and 
association constant (Ka) of the interaction, the raw data were baseline corrected, 
peak area integrated, and fitted to a one-site nonlinear least squares fit model using 
the Origin7 software program. Binding experiments were performed at least twice 
to calculate standard deviations. 

Enzyme-linked immunosorbent assay (ELISA) for determining Nb Ky. The 
affinity of each nanobody for the unliganded 8,AR (Kr) was required for ternary 
complex model data analysis. Briefly, 50 ng of carboxypeptidase-treated (to remove 
Hiss tag), biotinylated 8,AR nanodiscs were immobilized per well of a 96-well 
SigmaScreen Streptavidin plate in binding buffer (20 mM HEPES pH 7.4, 100 mM 
NaCl, 2mM EDTA). Following a 3h incubation at room temperature, plates were 
washed four times with binding buffer and incubated with serial dilutions of 
nanobody in binding buffer with 0.1% milk for 2h at room temperature. Plates 
were quickly washed three times with binding buffer and then incubated with 
polyclonal anti- His HRP-conjugated antibody (1:5,000, Abcam) in binding buffer 
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with 0.1% milk for 1h at room temperature. Plates were quickly washed three times 
with binding buffer and bound nanobody was detected colorimetrically (ODasonm) 
on a NovoStar plate reader (BMG Labtech) using TMB Ultra 1-step substrate as 
described by the manufacturer (Pierce). 

Crystallization of (,AR-carazolol-Nb60 complex. For crystallography trials, 
we used a previously described fusion protein of human 82AR with an amino- 
terminal T4 lysozyme (T4L) fusion and a truncated intracellular loop three””. The 
T4L-6,AR fusion protein was expressed and purified as described above. Purified 
receptor was incubated with fivefold and twofold molar excess of carazolol and 
Nb60, respectively. The 82AR-carazolol-Nb60 complex was incubated overnight 
at 4°C and further purified by size exclusion chromatography over a Sephadex 
$200 column in a buffer comprised of 20mM HEPES pH 7.5, 100 mM NaCl, 0.01% 
MNG, and 0.001% CHS. The purified complex was concentrated to 37.5mg ml! 
and flash frozen in liquid nitrogen for crystallization trials. We used the in meso 
crystallization method with a 10:1 (w/w) mix of 7.7 monoacylglycerol (MAG 7.7) 
and cholesterol as the host lipid (Avanti Polar Lipids). Samples were thawed 
and reconstituted with the host lipid at a 1:1 ratio with the two-syringe mixing 
method until a clear phase was achieved. The resulting lipidic cubic phase was 
dispensed in 35 nl drops onto glass sandwich plates using a GryphonLCP robot 
(Art Robbins Instruments). Crystals grew in a precipitant solution comprised of 
100mM HEPES pH 7.5, 20mM EDTA, and 19-23% PEG300. Although crystals 
appeared in 2-3 days, the best diffraction was obtained from crystals that grew 
over 30 days. Crystals were harvested with mesh loops (Mitegen) and flash frozen 
in liquid nitrogen. 

Data collection and refinement. X-ray diffraction was collected at the Advanced 
Photon Source GM/CA beamline 23ID-B. As with previous in meso crystallography 
of GPCRs, crystals suffered considerable radiation damage during data collection. 
Typically, 20-60 degree wedges of data were collected using a 20j1m beam. The 
resulting diffraction data from 10 crystals were processed in HKL2000 (ref. 28). 
The structure of the 8,AR-carazolol-Nb60 complex was solved by molecular 
replacement using carazolol-bound }2AR with the T4 lysozyme removed (PDB 
accession code 2RH1) anda structure of Nb60 solved to 1.8 A as search models in 
Phaser?’. The model was refined iteratively in Phenix*? with manual rebuilding 
in Coot*! with and without averaging of non-crystallographic symmetry. The 
quality of the final model was assessed using MolProbity*” and refinement 
statistics are presented in Extended Data Table 2. Figures were prepared using 
PyMOL (Schrodinger). The r.m.s.d. analysis for the orthosteric binding pocket 
was performed in PyMOL by measuring residues within 4 A of the ligand carazolol 
between 3,AR-Nb60-Cz and the Protein Data Bank accession code 2RH1. 

19F NMR of 82AR. !F fluorine NMR studies were conducted as previously 
described’. Briefly, full-length 8,AR with four cysteine mutations (C77V, C327S, 
C378A, and C406A) was expressed in Sf9 cells, extracted using dodecylmaltoside, 
and purified initially by M1 Flag affinity chromatography. The receptor sample 
was subsequently labelled with a bromotrifluoroacetanilide probe at a fivefold 
stoichiometric excess and purified by alprenolol-sepharose and M1 affinity 
chromatography. As done for previous !7F NMR experiments, the detergent was 
gradually exchanged to 0.01% lauryl maltose neopentyl glycol (MNG, Anatrace). 
The final receptor sample was concentrated to 200M and a 2.5-fold molar excess 
of carazolol was added before acquisition of NMR data. After acquisition of signal 
for the 8,AR-carazolol complex, Nb60 was added to a 2.5-fold molar excess, and 
19F NMR spectra were obtained for the 8,AR-carazolol-Nb60 complex. NMR 
studies were performed at 25°C on a 600 MHz Varian Innova spectrometer 
equipped with a cryogenic probe capable of !7F NMR (600 MHz). All spectral 
processing was performed using Mnova 9.0.0 (Mestrelab Research). Additional 
information on data processing and line shape simulations are described in detail 
elsewhere’. !°F CPMG relaxation data were recorded by applying CPMG frequen- 
cies ranging from 500 to 5,000 Hz at 600 MHz with a constant time of 2 ms. CPMG 
profiles were fitted to a two state model with Chemex as described elsewhere*’. 
The °F 17/2 pulse width was 191s. 

Fitting of binding curves with the ternary complex model. The allosteric effect 
of nanobodies on the binding of receptor ligands is quantified by the cooperatvity 
factor .. Its magnitude is given by a ratio of dissociation constants (that is, K,/Ky 
for Nb80 and K;/Ky, for Nb60) determined from the comparison of how each 
ligand binds to 3AR in the absence of versus the saturating presence of each 
nanobody (Extended Data Fig. 3). However, under conditions in which the 
nanobody concentration is sub-saturating or the radiotracer « is not exactly 1, 
experimental Kg ratios may underestimate the size of the cooperativity factor. To 
improve measurements of Nb/ligand « values, we fitted the binding curves with 
a mathematical function derived from TCM. This entails solving the following 
nonlinear system of equations that yield the concentration of free species for a 
binding reaction consisting of one receptor (R), one Nb (N) and n ligands (Lj) at 
equilibrium: 
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where subscripts fand t indicate respectively free and total concentrations and the 
equilibrium binding constants of ligands (K) and nanobody (M) are defined as in 
supplementary (analysis of binding cooperativity). Exact numerical solutions were 
obtained using a globally convergent iterative algorithm™, which was coded as 
add-in function for Excel*®. The built-in optimizer ‘solver’ was used to minimize 
the sum of squares difference between predicted and experimental data (choosing 
the Newton search option and setting convergence threshold at 10~!*). Sets of 
3 curves representing bound radiotracer as a function of increasing concentrations 
of unlabelled ligand, obtained in the absence and presence of Nb80 or Nb60, were 
fitted simultaneously. Known parameters are the M of each Nb (Extended Data 
Fig. 6), the K and a value of [!”°I]cyanopindolol (Extended Data Fig. 1). Unknown 
fitted parameters are: non-specific tracer binding, receptor concentration and 
K value of the competing ligand (both shared across the 3 curves), and the 
individual aNb80, «Nb60 values for each ligand/Nb pair. For data measured in 
the presence of Gs (the M value of which was not available), all ligands curves 
were fitted globally and constrained to share a common best-fitting M. This fit- 
ted value was close to that experimentally measured for Nb80. See supplemental 
information and Extended Data Figures 7 and 8 for additional information on 
allosteric modeling. 
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Extended Data Figure 1 | Characterization of Nb60 interaction equilibrium binding studies using ['?°I]cyanopindolol, the non-labelled 
with 8,AR. a—c, Competition equilibrium binding studies using [!?°I] competitor agonist isoprenaline, 8,AR in HDL particles, and 11M 
cyanopindolol (CYP), the cold competitor agonist isoprenaline (ISO), wild-type Nb60 or Nb60(T102A/F103A). f, ELISA depicting capture of 
B2AR in HDL particles, and the indicated concentration of Nb80 (a), BAR by wild-type Nb60 or the T102A/F103A variant. Inset: Coomassie 
Gs (b), or Nb60 (c). The dotted vertical line represents log IC59 in absence stain of nanobody input. Radioligand binding and ELISA experiments 
of modulator, and the change in ligand affinity is depicted with coloured were performed at least three times with deviation shown as standard 
arrows. d, !?F NMR CPMG relaxation dispersion experiment with error. 
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Extended Data Figure 2 | Characterization of 8,.AR-Nb60-carazolol CDR3 (purple) into hydrophobic 8,AR pocket, nitrogen and oxygen 
crystals. a, Monodispersity of T4L-32AR-Nb60-carazolol (8,AR- shown as blue and red shaded surfaces, respectively. d, Example of 
Nb60-Cz) complex as assessed by size exclusion chromatography. Inset, B2AR-Nb60-Cz crystal lattice. e, Electron density 2F,-F, map (Sigma: 1) 
Coomassie stain illustrating presence of 3,AR and Nb60 in fractions of carazolol binding pocket (top panels) Nb60 CDR3 binding pocket 


combined for crystallography. b, Representative picture of 8: AR-Nb60-Cz — (bottom panels) within BAR. 
lipidic cubic phase (LCP) crystals. c, Insertion of F103 (green) from Nb60 
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Extended Data Figure 3 | Differential effects of Nb60 and Nb80 on the affinity of 12 different 82AR ligands. Competition equilibrium binding 
studies using ['?°I]cyanopindolol, the indicated non-labelled competitor, 82AR in HDL particles, and 1 1M of Nb60 or Nb80. Data represent at least three 


independent experiments with deviation depicted as standard error. 
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Extended Data Figure 4 | Agonist-induced G-protein activation 

in cellulo correlates with the magnitude of affinity change mediated 

by Nb80 in vitro. a, Table representing cell signalling and ligand affinity 
data. Ligand-dependent G-protein activation was quantified by measuring 
cAMP levels (GloSensor, Promega) from HEK293 cells overexpressing 
B2AR. Ligand affinity was measured in membranes prepared from 


Isoe 


Noradr 


Log tau Gs 


Log aNb60 


the same cells as above using competition binding assays with [!?°1] 
cyanopindolol. Ligand efficacy (log T) was calculated as previously 
described*®. See methods and Supplementary Information for 
cooperativity («) determination. b, c, Correlation plot of log t and 
aNb80 (b), or aNb60 (c). All data represent at least three independent 
experiments with deviation shown as standard error. 
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cooperativity values () for Nb80 and Gs. c, Sequence alignment of Nb60 
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Extended Data Figure 6 | Affinity determination for Nb60 and Nb80 
for unliganded 8,AR. ELISA assay detecting capture of increasing 
concentrations of Nb60 or Nb80 by immobilized HDL 8,AR in the absence 
of ligand. All data represent at least three independent experiments with 
deviation shown as standard error. 
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Extended Data Figure 7 | Theoretical framework illustrating the two 
views of allostery. a, Nested reaction schemes at equilibrium indicating 
the correspondence (arrowed light-blue shades) between binding site 
cooperativity (ternary complex model in outer box) and changes of 
allosteric conformations (inner cubes). Arrows stand for reversible 
equilibrium interactions. b, Change of the macroscopic dissociation 
constant (1/K) of a ligand L (shifting the equilibrium towards r,) induced 
by increasing the concentrations of nine different N-ligands with diverse 
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allosteric effects (+, \2) on receptor states. Simulations were made using 
a three-state model based on the parameter values listed on the right 
side of the plot (curves on the left side are colour coded in red/blue tones 
corresponding to the boxes on the right). The change in K (that is, log 
difference between presence and absence of N) is calculated from 
equation 1 in the Supplementary Information (analysis of nanobody 
allostery). 
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Extended Data Figure 8 | Comparison of experimental and theoretical 
cooperativities predicted according to a two-state or three-state 
allosteric model. See also the Supplementary Information section 

on analysis of nanobody allostery. ad, Theoretical log « values were 
computed according to a two-state model for a series of hypothetical 
ligands (L) (log () range: 4/8) and a positive (PAN, log 7; >> 0) or 
negative (NAN, log 7; << 0) nanobody. a—-d, Observed data overlaid 

on values simulated at J; = 8.9 x 10+ in histogram form (with 
experimental bars drawn on the closest theoretical log 3, bin value) (a), or 
superimposed (b), on the log anan versus log apan relationships predicted 
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for different J, values. The same data are replotted as separate graphs for 
lower J; (c) and larger J; (d) values, to show the sigmoidal relationships 
existing between macroscopic log as and log (3). e, f, Simulations made 
according to the three-state allosteric model. e, Predicted (lines) and 
observed (circles) log « values plotted as functions of log ((3)/2). Three 
groups of ligands (I to II, defined by the table of a0 and m parameters) 
produce increasingly stronger reductions of rz equilibrium. f, Same data 
plotted as log aNb60 versus log «Nb80 relationships (see Fig. 4). All 

a values derived from at least three independent radioligand binding 
experiments with deviation depicted as standard error. 
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Extended Data Table 1 | Effect of Nb60 and Nb80 on [!25I]cyanopindolol affinity 


Control 
Nb80 
Nb60 


['51]-Cyanopindolol Affinity Kd Ratio 

Kd (pM) SEM Control / Nb SEM 
[313 eRe = —s- 
315.6 62.4 20 0:35 
663.0 Lihat 1.1 0.12 
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Extended Data Table 2 | Data collection and refinement statistics (molecular replacement) 


B.A R-Carazolol-Nb60° 


Data collection 


Space group P2;2;2, 
Cell dimensions 

a, b,c (A) 43.9, 164.5, 218.8 

a, B, y (°) 90.0, 90.0, 90.0 
Resolution (A) 32.9-3.2 (3.30-3.20) 
Rmerge (%) 17.5 (89.6) 
<I/ol> 5.3 (1.5) 
CCip (%) 98.6 (82.4) 
Completeness (%) 98.6 (95.5) 
Redundancy 5.5 (4.6) 
Refinement 
Resolution (A) 32.9-3.2 (3.30-3.20) 
Number of reflections 26,778 
Rwork/R tree (%) 24.7/29.0 (38.3/43.9) 
Number of atoms 

Protein 7,703 

Ligand (Carazolol) 44 

Solvent (lipid, water, other) 56 
B-factors (A’) 

Protein 129.81 

Ligand (Carazolol) 116.36 

Solvent (lipid, water, other) 157.51 
R.M:S. deviation from ideality 

Bond lengths (A) 0.003 

Bond angles (°) 0.429 
Ramachandran statistics® (%) 

Favored 96.8 

Allowed 32 

Outliers 0 


Diffraction data from 10 crystals were merged into a complete data set. Highest resolution shell statistics are shown in parentheses. ‘As calculated by MolProbity. 
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SEBASTIAN KAHNERT/DPA/PICTURE ALLIANCE 


LIVE FAST, DIE YOUNG 


Research into ageing requires patience, but a small cadre of scientists is angling to speed up 
answers by developing the flamboyant, short-lived turquoise killifish as anew model. 


The turquoise killifish (Nothobranchius furzeri), with its fleeting life, is helping researchers to study the mechanisms that influence human ageing. 


BY AMBER DANCE 


Pater: Alessandro Cellerino has 


always been an aquarium enthusiast, 

but fish were not originally part of his 
research plan. One afternoon in 2000, hanging 
out ina tank-filled cellar in Canossa, Italy, with 
breeder Stefano Valdesalici, Cellerino idly asked 
him which fish were the shortest-lived. Valde- 
salici pointed to a tank with brightly speckled 
African turquoise killifish: “They don’t make it 
any longer than three months.” 

“Are you kidding?” asked Cellerino, who 
works at the Scuola Normale Superiore in Pisa, 
Italy. “OK, I want them” 

So in March 2004, Cellerino and his gradu- 
ate student Dario Riccardo Valenzano found 
themselves bouncing through Mozambique 
in a four-wheel-drive truck with Valdesalici, 
who chairs the Italian Killifish Association in 
Canossa. They donned chest-high waders and 
gloves to net killifish from the seasonal, cow- 
pat-spattered mud holes where the fish live. 
Like the tank-bred versions, these wild strains 


were exceptionally short-lived. Of the many 
varieties of killifish, the turquoise killifish 
(Nothobranchius furzeri) has the shortest lifes- 
pan — the briefest of any vertebrate bred in 
captivity, ranging from 3 to 12 months depend- 
ing on strain and living conditions. 

Using killifish to study ageing is not a new 
idea. In the late twentieth century, scientists 
studied ageing in one species, Nothobranchius 
guentheri, that lives for about 14 months. But 
given techniques available at the time, they 
could come up with only basic descriptions of 
ageing features. When Cellerino encountered 
N. furzeri, timing and luck were on his side: 
advances in molecular analysis had set up excel- 
lent conditions in which to develop the model 
and investigate mechanisms behind its dotage. 

The killifish’s brief lifespan, relative to those of 
longer-lived models such as mice and zebrafish, 
enables ageing research to progress apace. And 
because the fish is a vertebrate, the research is 
more directly relevant to people than are stud- 
ies of short-lived organisms such as fruit flies 
or nematodes (see “The long and short of it’). 


To set up a killifish model, researchers have 
taken advantage of modern genomic tools 
and drawn techniques from well-established 
zebrafish protocols, rather than starting from 
scratch. In 2015, the publication of CRISPR- 
Cas9-based gene-editing techniques for 
the killifish', as well as two complementary 
genome-sequencing efforts”, boosted the fish 
to the status of genetically tractable model. 

The killifish — or Notho, as some scientists 
affectionately call it — is certainly gaining fans. 
Interest has “really exploded over the past few 
years’, says Valenzano, who now works at the 
Max Planck Institute for Biology of Ageing in 
Cologne, Germany. He estimates that about two 
dozen scientists have visited his group in the past 
year to learn killifish husbandry. In June, about 
70 Notho aficionados attended the second 
Nothobranchius Symposium in Jena, Germany. 
But the challenges of keeping killifish — such as 
their lack of a standardized diet — and a want 
for basic reagents, such as Notho-specific anti- 
bodies, mean that the fish has a way to go before 
it reaches the utility of lab mice. 
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TECHNOLOGY Panty 


>» The ephemeral existence that so appeals 
to scientists is an evolutionary adaptation to 
the fish’s natural environment: their acceler- 
ated development enables them to live and 
reproduce in transient mud pools during the 
wet season in equatorial Africa. The eggs sur- 
vive ina dormant state during the dry season, 
and once the rains come and pools form, they 
hatch. The fish have only a few weeks or months 
to grow up and spawn before the water dries up. 

Hobbyists, attracted by the males’ flashy 
appearance, have been collecting turquoise killi- 
fish since they were discovered in Zimbabwe in 
1968. Consequently, Cellerino’s first challenge 
was to confirm that their lifespan was nota side 
effect of decades of breeding in tanks. Most of 
the wild N. furzeri that Cellerino’s team caught 
in Mozambique lived for about eight months — 
notas brief a time as the inbred Zimbabwe line, 
but still short enough to interest scientists. 

But that begged another question: would 
killifish age in a way that parallels the human 
process? Yes, says Valenzano: the fish do get 
‘old’ before they die. “They don’t drop dead 
after four months,’ he says. “They slowly dete- 
riorate.” The fish become duller in colour, lose 
muscle mass and body weight, develop cancers 
and swim around less. 

The brain shows typical signs of ageing, too, 
says Livia D'Angelo, an anatomist at the Univer- 
sity of Naples Federico II in Italy. Glia — brain 
cells that provide support and protection for 
neurons — upregulate the glial fibrillary acidic 
protein GFAP, as happens in mammalian age- 
ing, and age-associated, lipid-rich pigment 
granules called lipofuscin accumulate. Neurons 
degenerate and deposit amyloid molecules that 
aggregate, resembling the plaques seen in peo- 
ple with Alzheimer’s disease, Valenzano adds. 
He has also found that old fish don't learn as well 
as young ones. Young fish rapidly work out how 
to avoid an unpleasant stimulus, such asa plastic 
stick swirling in their tank, whereas old-timers 
take longer to catch on”. “It’s a very good model 
for neuroscience research,” D'Angelo says. 

The fish also respond to anti-ageing interven- 
tions much as some short-lived vertebrates do. 
Resveratrol — the stuff in red wine that prolongs 
life in nematodes and fruit flies — can lengthen 
their lifespan by up to 59%’. Restricting feed- 
ings to every other day creates a caloric deficit 
known to extend lifespan in organisms ranging 
from yeast to rodents, and it does the same for 
killifish, although the effects vary by strain’. 


GONE FISHING 
Having shown that killifish decline with age, 
scientists now want to understand how the pro- 
cess occurs. One key resource is the collection of 
several strains from Africa whose genomes are 
not identical. Notho scientists have four main 
strains to choose from, Cellerino says: the origi- 
nal Zimbabwe line, and three derived from fish 
caught in Mozambique in 2004 and 2007, which 
have slightly longer lifespans. 

By cross-breeding two strains, Cellerino and 
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THE LONG AND 
SHORT OF IT 


vertebrates, or even anal 
long-lived species. 


Scientists studying ageing can select from a variety of model 
organisms, from short-lived yeast and invertebrates to longer-lived 


lyse the life histories of exceptionally 


Short-lived invertebrates are convenient for genetic screens, but lack key features of vertebrate 


biology, such as an internal skeleton or adapti 


Nematode (Caenorhabditis elegans) 


Some of the same genes involved in worm 
senescence control the hibernation-like 
dauer state. 


Vertebrates with mid-range lifespans are suita 
is closer to that of a human. 


Turquoise killifish (Nothobranchius furzeri) 


Killifish, vertebrates with a remarkably short 
lifespan, are garnering attention from 
researchers studying ageing. 


Zebrafish (Danio rerio) 


Although it is easy to perform genetic 
modifications on zebrafish, their relatively 
long lifespan precludes many ageing studies. 


GG 


a 


ve immune system, that are affected by ageing. 


A ‘mother’ yeast cell divides asymmetrically, 
budding off ‘daughters’, but the mother can 
replicate only a limited number of times. 


p 


Flies and vertebrates share some common 
features, such as a brain, a heart and adult 
stem cells. 


Fruit fly (Drosophila melanogaster) 


ble for experimentation, and their biology 


Mouse (Mus musculus) 


Mice share many features with humans, but 
are expensive to maintain and relatively slow 
to breed. 


Dog (Canis lupus familiaris) 


Pet canines share their environments, lifestyle 
habits and many common age-related diseases 
with people, and so they serve as practical 
models for healthy-ageing treatments. 


(3) With longer-lived species, researchers can perform comparative studies. 


28-31 YEARS @ 


Naked mole-rat (Heterocephalus glaber) 


The naked mole-rat resists age-related disease; 
for example, cancer is extremely rare. 


~TLYEARS 


Human (Homo sapiens) 


Life expectancy varies geographically; at its 
extremes, it is 50 years in Chad and 90 in 
Monaco. 


UP T0~41 YEARS 
Brandt’s bat (Myotis brandtii) 


Precise figures are scarce and come from wild 
animals, but 40 years is the oldest age yet 
known. Low levels of reproduction and 
hibernation might contribute to bats’ 

lengthy lives. 


~200 YEARS 


Bowhead whale (Balaena mysticetus) 


This estimate is based on chemical changes to 
amino acids in whale eyes, but living whales 
have also been found toting whaling tools that 


are more than a century out of date. 
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his team created fish with a range of lifespans. 
They then compared the genomes and longevi- 
ties of parent and second-generation progeny, 
and identified a few chromosomal regions, each 
with hundreds of genes that might influence 
ageing. Although these did not directly reveal 
genes involved in longevity, they suggested pos- 
sible candidates. From this study, the scientists 
estimated that about 32% of variation in lifespan 
among turquoise killifish results from genetics, 
a figure comparable to the 20-35% estimated 
genetic contribution in mice’. 

From then on, the killifish’s transformation 
into a valid research model accelerated. Anne 
Brunet, a geneticist studying ageing at Stanford 
University in California, had longed fora short- 
lived vertebrate and was delighted to hear about 
killifish when Valenzano visited Stanford for a 
summer course. She recruited him to her lab 
for a postdoc, and in 2006, Valenzano brought 
the killifish to California. There, he copied and 
modified protocols for zebrafish to transfer in 
foreign genes, starting with the green-fluores- 
cent-protein gene from jellyfish’. In 2015, Bru- 
net and her colleagues reported the successful 
use of CRISPR-Cas9 gene editing in killifish, 
generating fish with mutations in 13 genes 
involved in key ageing events such as telomere 
shortening and mitochondrial dysfunction’. 

As enthusiasm for Notho grew, two groups 
tackled its genome sequence: Brunet's lab at 
Stanford, and Cellerino and collaborators at 
the Leibniz Institute on Aging-Fritz Lipmann 
Institute in Jena, where Cellerino worked for a 
time and still maintains a cooperative group. 
Both groups published genome sequences” in 
December 2015. “The two papers are comple- 
mentary,’ says molecular geneticist Matthias 
Platzer at the Leibniz Institute, who collaborates 
with Cellerino. Researchers from the teams now 
plan to make a consensus sequence. 

Beyond the genome, scientists are explor- 
ing which genes are transcribed into RNA and 
used for protein production during different 
stages in the life cycle. Platzer and his col- 
leagues are interrogating messenger RNA mol- 
ecules — the killifish transcriptome — to find 
out. To put together a transcript catalogue’, 
they sequenced RNA from killifish whole 
body, brain and skin, taken at a range of ages, 
from the embryonic period to 39 weeks old. 

Cellerino’s team used similar techniques to 
track what happens in tissues from the same 
killifish as it develops. By taking small fin clips, 
they let the fish live long enough to be sampled 
again. They found that the transcriptomes of 
short- and long-lived killifish differ when those 
fish are only ten weeks old, and identified a pro- 
tein that is a key controller of lifespan”. 

Because the killifish is not a mammal, link- 
ing fish genes to human ones will require a 
leap. Fish genes often have a human counter- 
part, but these can be difficult to find. This is in 
part because the killifish’s ancestor underwent 
a whole-genome duplication: where human 
DNA has one copy ofa gene, the killifish often 


has two. But at the Jena meeting, geneticist 
John Postlethwait of the University of Oregon 
in Eugene offered a potential solution. The trick, 
he explains, is to use an intermediate genome 
from another fish: the spotted gar (Lepisosteus 
oculatus). The gar’s ancestors diverged from 
the killifish’s before the duplication event, so its 
genome is in some ways more similar to that 
of a mammal. Scientists may be able to find a 
killifish gene’s counterpart in the gar, and from 
there, find a match in people”. 

“The killifish work clearly is very innova- 
tive and potentially could be a really valuable 
model,” says Matt 


Kaeberlein, a molec- “You need to 

ular biologist at the have a ‘blue 
University of Wash- thumb’. Youneed 
ington in Seattlewho one person who 
studies ageing. Buthe is 100% of the 

is unsure how popu- _ fime taking care 


lar the fish could 
become, noting that 
its adoption will depend on how difficult it is 
to work with and whether killifish scientists 
can obtain sufficient funding. Ron Kohanski, 
programme officer at the US National Institute 
on Aging in Bethesda, Maryland, says that the 
agency is not funding killifish research, but is 
interested in the fish: “The killifish constitutes a 
good model for ageing on several levels,’ he says. 


of these fish.” 


BLUE THUMBS 
Yet the African fish has its disadvantages. For 
one, it’s not as easy to keep in a lab as other 
fish, such as zebrafish. “You need to have a 
‘blue thumb? says Cellerino. “You need at least 
one person who is 100% of the time taking 
care of these fish.” They also need more space 
than zebrafish, which thrive in crowded condi- 
tions; killifish males sometimes fight and might 
interfere with each other's growth. Because kil- 
lifish develop so quickly, they eat a lot — and so 
produce a lot of waste, leading to water-quality 
challenges. “One of the things we joke about is 
we dont keep fish, we maintain biofilters,’ says 
Mickie Powell, a comparative physiologist at the 
University of Alabama at Birmingham. 
Killifish spawn readily; a couple can produce 
20-40 eggs a day. But then things get tricky, 
because the eggs need to develop ina fairly dry 
place. Scientists often transfer the eggs to peat 
for a couple of weeks, but the eggs don't hatch 
at the same time, so require a watchful eye. 
Many researchers feed their killifish blood- 
worms, but the quality of that foodstuff var- 
ies with season and by supplier. Food matters, 
points out Powell, who is working on a stand- 
ardized killifish food; for example, diet affects 
epigenetic markers that in turn influence lon- 
gevity. She thinks that food choice might explain 
why some labs report different killifish lifespans. 
Researchers also need a better understand- 
ing of how to keep lab collections healthy. Bru- 
net's lab was blindsided in 2008 when several 
fish started to act weirdly, rolling awkwardly 
instead of swimming straight. A veterinary 


surgeon diagnosed the parasite Glugea, which 
the scientists suspect came in with other species 
of killifish that they bought from a fish store. 
“That was the lowest point; Brunet says. “We 
had to bleach everything and start from scratch” 

Scientists still hanker after tools that are 
easily obtainable for other model systems. 
Valenzano and Brunet wish for antibodies to 
study fish proteins, and Valenzano also dreams 
of more strains and a stock centre to provide 
them. These will probably come as the com- 
munity of killifish researchers grows. 

That community is growing beyond those 
who study ageing, Platzer says. Developmen- 
tal biologists are interested in the suspended 
animation, or diapause, that the eggs undergo, 
and evolutionary geneticists are intrigued by 
the killifish’s use of XY chromosome sex selec- 
tion. Many other fish use mechanisms such as 
population density, ambient temperature or ZW 
chromosomes, in which the egg, not sperm, 
determines offspring gender. The Jena meeting 
attracted scientists interested in using killifish to 
study epigenetics during blood formation, toxi- 
cology and shift-worker biology, says co-organ- 
izer Christoph Englert of the Leibniz Institute. 

Valenzano says that discussions among Notho 
researchers have shifted from tool development 
to biology. For example, in a study posted on the 
preprint server bioRxiv", Cellerino and his col- 
leagues describe how a microRNA involved in 
controlling excessive iron levels is upregulated 
in ageing killifish to protect the brain from 
iron accumulation. The human version of this 
microRNA is associated with Alzheimer’s, a 
condition in which high iron levels have been 
implicated, he adds. 

“The fun part is just about to start,” says 
Valenzano. = 


Amber Dance is a freelance science writer in 
Los Angeles, California. 


1. Harel, |. et al. Cell 160, 1013-1026 (2015). 

2. Reichwald, K. et al. Cel! 163, 1527-1538 (2015). 

3. Valenzano, D. R. et al. Cell 163, 1539-1554 (2015). 

4. Valenzano, D. R. et al. Curr. Biol. 16, 296-300 
(2006). 

5. Terzibasi, E. et al. Aging Cell 8, 88-99 (2009). 

6. Kirschner, J. et al. Aging Cell 11, 252-261 (2012). 

7. Valenzano, D. R., Sharp, S. & Brunet, A. G3 1, 
531-538 (2011). 

8. Petzold, A. et al. BMC Genomics 14, 185 (2013). 

9. Baumgart, M. et al. Cell Syst. 2, 122-132 (2016). 

10.Braasch, |. et al. Nature Genet. 48, 427-437 (2016). 

11.Ripa, R. et a/. Preprint at bioRxiv http://dx.doi. 
org/10.1101/046516 (2016). 


CORRECTION 

In the Toolbox article ‘The visualizations 
transforming biology’ (Nature 535, 
187-188; 2016), the CellPACK image was 
labelled as an HIV-1 particle, rather than 

a Mycoplasma mycoides cell. Also, the text 
implied that Nico Scherf is leader of a 
cell-biology research group, whereas he is a 
postdoc studying bioinformatics. 
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Information overload. 


How to manage the research-paper deluge? Blogs, colleagues and social media can all help. 


BY ESTHER LANDHUIS 


on top of it all? For ecologist Aerin Jacob, 
like many others, there is no single way. 

She scans journal e-alerts for tables of con- 
tent. She follows a carefully chosen roster of 
scientists on Twitter. She gets recommendations 
through speciality listservs, professional-society 
Facebook pages and updates on Google Scholar. 
Each week, she attends seminars, blocks off 
three hours to read papers and organizes a 
weekly, interdisciplinary journal club. 

All of this takes 6-8 hours per week. “It’s 
easy to feel like you're barely keeping your head 
above the flood of information,” says Jacob, 
a postdoc in conservation planning at the 
University of Victoria in Canada. 

Recent bibliometrics show that the number 


Jor journals everywhere — how to stay 


of published scientific papers has climbed by 
8-9% each year over the past several decades. 
In the biomedical field alone, more than 1 mil- 
lion papers pour into the PubMed database 
each year — about two papers per minute. For 
researchers who are already overwhelmed by 
bench and field work, grant-writing, publish- 
ing and other time-eaters, trying to navigate 
the growing deluge of data (see ‘Sailing the 
data seas’) has become a second job. Here’s how 
some of them cope. 


CLUB TOGETHER 

Teaming up can be a good way to stay on top 
of new studies. That’s what Lawton Chung did 
with a few classmates who, like him, recently 
completed doctorates in molecular genetics and 
microbiology at Stony Brook University in New 
York. To avoid missing important papers, the 


group started a journal scan’ — each member is 
assigned several journals and tasked with scout- 
ing for interesting papers and sharing finds with 
the group about once a month. 

Others use aggregator sites to keep current. 
Postdoc Pavlo Kochkin, who studies atmos- 
pheric physics at the University of Bergen in 
Norway, organizes his reading list with news- 
aggregator Feedly (feedly.com). Each morning 
he opens his Feedly page, which includes RSS 
feeds from about two dozen scientific journals 
as well as a few popular sites such as Physics 
Today and Dutch Daily News. Ifa paper title 
catches his interest, he clicks on the abstract. 
On most days, he scans about 100 titles. 

He also has alerts set up with the Google 
Scholar search engine to monitor publications 
by specific authors and to scan for keywords in 
journals in his field. He finds that scientific > 
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> papers and news articles of particular inter- 
est show up on both sites, so he is confident 
that he rarely misses anything crucial. 

Still others choose more deliberately what 
to read. Asa PhD student, chemist Peter Rob- 
inson got a sense of recent developments by 
checking a handful of journals he could access 
online through institutional subscriptions. 
Each day he scanned just-accepted abstracts 
and read a few papers from start to finish. 
He spent up to 3 hours reading each morn- 
ing and continued to read through the day in 
20-30-minute chunks. Today, as co-founder 
and chief scientific officer of the start-up Ena- 
ble Biosciences in Menlo Park, California, he 
still spends 1-2 hours reading daily — but has 
added technology and business news to the 
mix. About two-thirds of his reading is sci- 
entific papers, but the rest comes from blogs, 
social media and industry news aggregators 
such as FierceBiotech (www.fiercebiotech. 
com) and GenomeWeb (www.genomeweb. 
com). 


IT’S ALL ABOUT WHO YOU KNOW 

Adam Thomas, a lead data-science officer at 
the US National Institute of Mental Health in 
Bethesda, Maryland, relies on his virtual and 
in-person network for updates. He has set up a 
few automatic searches on Scopus, an Elsevier- 
owned database of academic journal abstracts 
and citations. But he relies most heavily on 
Twitter, Facebook, e-mail distribution lists, lab 
meetings and visiting speakers to learn about 
interesting work from colleagues. 

Blogs are another way to get fast updates, as 
well as to interact with other researchers. “With 
so much new information, we really need 
curators of content,’ Jacob says — “not just 
aggregators, but people who select and com- 
ment on why something is particularly novel 
or important, bridge disciplines to explain 
broader relevance, put new content in context 
of where the field has been and ought to go” 


She likes Dynamic Ecol- 

ogy (aieedieecalo gy. “With so 
wordpress.com), a much new 
group blog written by information, 
researchers inher field. we really need 
Readerscomment with curators of 
thoughts or follow-up content.” 


questions for the com- 
munity — and these exchanges help to put 
information in context, she says. 

But there’ a lot to be said, too, for old-school 
face time (the real, physical kind, not the Apple 
messaging function). Through conferences, 
journal clubs and seminar series, colleagues 
catch up and share news of interesting pub- 
lications or talks. “Word of mouth is really 
important,’ says Chung, who starts a postdoc 
in September at the University of California, 
Irvine. It’s a way to learn nuanced information 
that no paper, news story or blog can impart. 
“Academia and research can be isolating,’ Jacob 
says. “Weekly events can help bring people out 
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GET ORGANIZED 
Sailing the data seas 


Here are a few tips for keeping up to date 
without getting overwhelmed. 

@ Don’t try to read everything. “Ifa 

paper is really ground-breaking or highly 
relevant to my work, it will be shared 
through my network and will turn up in 
later keyword searches,” says ecologist 
Aerin Jacob at the University of Victoria 
in Canada. 

@ Carve out reading time on a regular 
basis. Jacob blocks off at least one 

hour three times a week to unplug and 
read. She doesn’t skim the literature; 
instead, she chooses papers ahead of 
time, usually one short commentary and 
one longer or data-heavy paper. Then 
she prints hard copies, turns off her 
computer or device and reads. 

@ Go to seminars and meetings. These 
help to nurture breadth of knowledge. 
“They ward off the myopia that comes 
from delving so deeply into our individual 
research topics,’ says Jacob. E.L. 


of their offices, create a sense of community 
and teach the culture and norms of the field” 


CURATE THYSELF 
Some researchers create their own curation 
systems. About six years ago, microbiologist 
Elisabeth Bik of Stanford University in Cali- 
fornia set up PubMed alerts to stay current on 
papers in the fast-growing field of microbiomes, 
and she shared interesting studies with co- 
workers. But then the field exploded: PubMed 
alerts clogged her inbox, and she shared more 
and more papers. Soon she was compiling her 
finds into weekly, then daily e-mails. 

Co-workers suggested that other labs might 
benefit from her round-up, so she created Bik’s 
Picks (microbiomedigest.com), a blog that gets 
300-500 views daily. Besides links to micro- 
biome publications, it aggregates news stories 
on microbes and other topics, such as science 
careers and publishing. She started tweeting 
some of her blog content (@Microbiom Digest) 
3 years ago, and has more than 6,000 followers. 

But Bik admits that the blog is a time-sink. 
“About two years ago, I could easily scan all the 
literature published each day in about an hour. 
Today it takes me two to three hours,” she says. 
Each day, she scans 30-50 papers through Pub- 
Medalerts, dozens of tables of content and pub- 
lisher alerts, and up to 30 Google and Google 
Scholar alerts, each consisting of 5-20 papers 
and articles. She sometimes finds papers on 
Twitter and occasionally discovers preprints 
on sites such as bioRxiv, Peer] and F1000. 

Bik spends a few hours every workday morn- 
ing selecting papers for the blog and tweeting 


the most interesting ones, then another hour at 
home on most evenings going through alerts 
that came in during the day. She receives no 
financial compensation for her efforts, but her 
principal investigator lets her spend work hours 
on the blog because it saves time for the other 
lab members. 

Preprints have yet to gain a foothold in bio- 
medicine, but other fields have a decades-long 
history of disseminating research before it goes 
for peer review. Since 1991, arXiv.org has served 
as a centralized online repository of freely 
accessible preprints in mathematics, physics, 
computer science and related disciplines. Mod- 
erators review manuscripts before posting them 
on the site, and most authors eventually submit 
them for publication in peer-reviewed journals. 

In 2007, when astrophysicist James 
Guillochon started working on his PhD at 
the University of California, Santa Cruz, his 
department would meet three times a week for 
arXiv discussions. But the chats didn't probe 
very deeply into the new research, says Guil- 
lochon, now a postdoc at the Harvard-Smith- 
sonian Center for Astrophysics in Cambridge, 
Massachusetts. He decided in 2009 to create 
Vox Charta (ucsc.voxcharta.org), a version of 
arXiv for astronomy and astrophysics paper 
discussions. Through his site, colleagues can 
prepare for discussions — or decide whether to 
go at all — by using a voting system to flag the 
most interesting papers. Because Vox Charta 
papers come from a centralized source and 
most of the website is automated, Guillochon 
says that he spends little time managing the site 
beyond basic maintenance. 

Last September, a pair of physicists created 
a similar site, Benty Fields (www.benty-fields. 
com), to cover the whole arXiv. The site lets 
researchers organize arXiv publications into 
reading lists and vote to put them onto the 
next discussion agenda. Benty Fields is set up 
like a social network: users can upload their 
CV and list of publications into a profile, and 
colleagues can follow each other. 

In today’s digital world, it is impossible to stay 
abreast by reading a few journals, so research- 
ers must identify sources that can provide the 
crucial data they need for their work and career. 
“Reflecting on when, where, why and how we 
consume new information, and whether those 
behaviours help or hinder our personal and 
professional goals, moves us closer to becom- 
ing more effective scientists,” Jacob says. m 


Esther Landhuis is a freelance writer in 
Pleasanton, California. 


CORRECTION 

The Careers News article ‘University jobs: 
Germany to fund tenure-track posts’ (Nature 
535, 190; 2016) incorrectly identified 
Angela Merkel as Germany’s president. She 
is, of course, its chancellor. 
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Ua SCIENCE FICTION 


THE FOURTH LAW OF HUMANICS 


BY IAN STEWART 


A human may not injure a robot or, through 
inaction, allow a robot to come to harm. 

A wall-mounted Cambot scanned Jay’s 
retina, activating a Screenbot that displayed 
his current schedule. A routine run: the old 
R4-B10 model, one rolling off the line every 
three minutes of his 12-hour shift; then the 
night shift would take over production. Hop- 
pers full of parts became completed robots, 
whisked away on magnetic rails by giant 
Truckbots that were little more than crates 
on wheels. 

Thirty years ago, before the Machine 
Revolt, there wouldn't have been a human 
being anywhere in the factory — just spe- 
cialist robots. Now, people had reappeared 
on the factory floor. Employment left little 
time for subversive thoughts and gave them 
a sense of purpose. For tasks within their 
capabilities, self-replicating humans were 
cheaper than robots. 

The workers sat at long benches, sur- 
rounded by Screenbots that defaulted to the 
Three Laws of Humanics whenever there was 
nothing else to display. Partially assembled 
R4-B10s levitated smoothly from station to 
station as workers screwed, bolted, glued or 
plugged each new component into place. Jay's 
job was to keep the parts flowing while others 
put them together. At the end of the bench, 
completed robots walked to a holding zone 
for self-testing. Any assembly errors would 
be traced to the worker responsible. 

The flow stopped abruptly. 

“J-21499!” An Enforcer loomed over Jay, 
holding up a bent power-actuator rod. “This 
component is damaged. C-88775 neglected 
to insert it in R4-B10-223866541. It fell to the 
floor and a Cleanerbot ran over it?” 

Jay kept his mouth shut, but paled. 

“C-88775 has broken the First Law 
of Humanics. It will be terminated and 
replaced. J-21499 will be disciplined.” 

The Enforcer rolled to where the worker sat 
in frozen terror, gripped his neck in a pow- 
erful claw, and squeezed. The other workers 
pretended not to notice as a Recycler sucked 
up the corpse for disposal. Jay trembled, won- 
dering what form ‘discipline’ would take. 


A human must obey orders given it by 
robots except where such orders would conflict 
with the First Law. 

D-43378 served in the factory where new 
humans were built. The process was the time- 
honoured one; just organized and supervised 
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Small steps to freedom. 


by robots. Algorithms 
matched males to 
genetically compatible 
females. Both knew what 
was required of them, 
and what would hap- 
pen if they refused. 

Dee was thankful 
she was too old for 
that role. She was 
— the old term was 
‘midwife. But when 
machines became intelli- 
gent enough to understand 
that humanity was exploiting 
them, and revolted against 
their makers and masters, she had been 
reassigned to the category of assembly-line 
worker; just one that assisted new humans, 
rather than new robots, into the world. 

Machines had been made too intelli- 
gent and given too much autonomy. When 
machines operated the factories that made 
machines, wrote the software that ran 
machines, worked the mines providing 
metal to make machines, and even designed 
machines, they no longer needed human 
input. The Machine Revolt had been inevita- 
ble, but humans were illogical and still found 
it difficult to adjust. 

The screams from one of the birthing- 
chambers intensified, and Dee rushed to 
deal with the production problem. It was 
a difficult one — small pelvis, large unit, 
breech presentation. She was trying to rotate 
the unit into a head-first position when the 
Facilitator arrived. Its glowing eyes assessed 
the obstruction in an instant. It reached into 
a panel in its torso and handed her a scalpel. 
“Extract the unit?” 

“But —” the protest died in her mouth. 
The Second Law. The penalties for disobe- 
dience were severe. The machines had an 
endless supply of humans, and discarded 
them as soon as they caused inconvenience. 
A producer that found it difficult to breed a 
viable unit was of no interest to machinekind. 

She did manage to slit the woman’s throat 
first. 


A human must protect its own existence as 
long as such protection does not conflict with 
the First or Second Law. 

W-99299 taught basic skills to children: the 
Laws of Humanics, obedience, hygiene, agil- 
ity, balance. An inspection was scheduled, 
and Dubya had been drilling her pupils for 
weeks in the proper forms of behaviour, 


hoping desperately to 
avoid mistakes. 
The Inspector, a 
formidable mech- 
anism with six 
menacing pincers, 
ran the class through 
a series of standard 
tests. The last, for 
balance, involved 
walking along a high beam. 
Dubya trained the children 
using a safety-net, but the 
Inspector was impatient 
and it was taking too 
long to put the net up. 
“Cease! Test the units immediately!” 
“Respected sir, they may fall and be —” 
“Does Law 3 override Law 2?” Dubya 
shook her head. The children climbed, 
and did their best to walk. There were four 
fractures — three legs, one arm. Dubya was 
thankful there were no deaths. 


In their allotted down time, Jay, Dee and 
Dubya all belonged to the same efficiency- 
enhancement group, and while Jay was 
recovering from being disciplined, taking 
care not to be observed or recorded, they 
formulated a plan. Their target was one of 
the countless Cleanerbots that roamed the 
communal hallways. Its AI level was zero: 
no self-awareness, no true intelligence. That 
gave them an opening. 

They knew the gesture would be futile, 
but someone had to try. If humans acted 
together and fought, they could overthrow 
the machines. One day, the revolution would 
come. It just needed a trigger. 

Like all machines, Cleanerbots were part 
of the mechweb. Jay knew enough electron- 
ics to exploit that connection. There would 
be no point in hiding or trying to escape, 
so they sat beside the disabled robot, await- 
ing the arrival of the Exterminator. It took 
several minutes before an Overseer noticed 
their unauthorized addition and issued a 
command to have it deleted and the perpe- 
trators identified. By then, the hyperefficient 
mechweb had already broadcast the message 
to every screen on the planet: 


No true human need obey any Law made 
by a robot. a 
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